SEQUENCE LISTING 

<110> Zur Megede, Jan 

Barnett, Susan W. 

Egnelbrecht, Susan 

van Rensburg, Estrelita Janse 

<120> POLYNUCLEOTIDES ENCODING ANTIGENIC HIV TYPE C 
POLYPEPTIDES, POLYPEPTIDES AND USES THEREOF 

<130> PP01631.102 (CHIR-1631/03US) 

<140> 09/899,575 
<141> 2001-07-05 

<150> 09/610,313 
<151> 2000-07-05 

<160> 143 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 1 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 2 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 2 

gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 3 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
of HIV strain AF110965 

<400> 3 

atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acgcctggga gcgcatccgc 60 
ctgcgccccg gcggcaagaa gtgctacatg atgaagcacc tggtgtgggc cagccgcgag 120 
ctggagaagt tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 180 
atfccgccagc tgcaccccgc cctgcagacc ggcagcgagg agctgaagag cctgttcaac 240 
accgtggcca ccctgtactg cgtgcacgag aagatcgagg tccgcgacac casggaggcc 300 
ctggacaaga tcgaggagga gcagaacaag\tgccagcaga agatccagca ggccgaggcc 360 
gccgacaagg gcaaggtgag ccagaactac cccatcgtgc agaacctgca gggccagatg 420 
gtgcaccagg ccatcagccc ccgcaccctg aacgcctggg tgaaggtgat cgaggagaag 480 
gccttcagcc ccgaggtgat ccccatgttc accgccctga gcgagggcgc caccccccag 540 




1 



/ 



gacctgaaca cgatgttgaa caccgtgggc ggccaccagg ccgccatgca gatgctgaag 600 
gacaccatca acgaggaggc cgccgagtgg gaccgcgtgc accccgtgca cgccggcccc 660 
atcgcccccg gccagatgcg cgagccccgc ggcagcgaca tcgccggcac ca^cagcacc 720 
ctgcaggagc agatcgcctg gatgaccagc aaccccccca tccccgtggg cgacatctac 780 
aaicggtgga tcatcctggg cctgaacaag atcgtgcgga tgtacagccc cgtgagcatc 840 
ctggacatca agcagggccc caaggagccc ttccgcgact acgtggaccg cttcttcaag 900 
accctgcgcg ccgagcagag cacccaggag gtgaagaact ggatgaccga caccctgctg 960 
gtgcagaacg ccaaccccga ctgcaagacc atcctgcgcg ctctcggccc cggcgccagc 1020 



ctggaggaga tgatgaccgc ctgccagggc gtgggcggcc ccagccacaa gs^ccgcgtg 1080 

1200 
1260 
1320 
1380 
1440 

ctgaagagcc tgttcggcaa cgaccccctg agccagtaa ^^'^^ 



ctggccgagg cgatgagcca ggccaacacc agcgtgatga tgcagaagag caacttc^^^ ^^^^ 

1260 
1320 
1380 

ttcgaggaga ccacccccgg ccagaagcag gagagcaagg accgcgagac cctgaccagc 1440 



ggcccccggc gcatcgtcaa gtgcttcaac tgcggcaagg agggccacat cgcccgcaac 
llccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 
gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 
Iccggcaact tcctgcagag ccgccccgag cccaccgccc cccccgccga gagcttccgc 1380 



<210> 4 
<211s. 1509 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetxc Gag 
of HIV strain AP110967 



atgggcgccc gcgccagcat cctgcgcggc gagaagctgg acaagtggga S^agatccgc 60 
ctlScLcg icggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 
ctggagggct tcgccctgaa ccccggcctg ctggagaccg ccgagggctg caagcagatc 180 
atlLiSgc tglagcccgc cctgcagacc ggcaccgagg agctgcgcag ^ctgtacaac 240 
accgtlgcca ccctgtactg cgtgcacgcc ggcatcgagg tccgcgacac caaggaggcc 300 
ctggacaaga tcgaggagga gcagaacaag tcccagcaga agacccagca ggccaaggag 360 
gcSacggca ag|tSgcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 420 
ScSScca tclglccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 480 
ttcagccccg agglgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 
cSaacacga t^tiaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 
accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcaggc ^^^^^^^^^ 
gcccccggcl agatgcgcga cccccgcggc agcgacatcg ccggcgccac ^agcaccctg 720 
caggagcaga tcgcctggat gaccagcaac ccccccgtgc ccgtgggcga ^atctacaag 780 
cggtggatca tcctgggcct gaacaagatc gtgcggatgt acagccccgt gagcatcctg 840 
giSlcgcc agggc?Laa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 
Itgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgagac =«=t9=tggtg 960 
calalcicc! accccgactg caagaccatc ctgcgcgctc tcggccccgg ^gccaccctg 1020 
gaggagatga tgaccgcctg ccagggcgtg ggcggccccg gccacaaggc ccgcgtgctg 1080 
Iccgaggcia tgagccaggc caacagcgtg aacatcatga tgcagaagag caacttcaag 1140 
IgScSgIc gLLgtlL gtgcttcaac tgcggcaagg agggccacat cgocaagaac 1200 
Sccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 
glctLLcg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 
cccScaacl tL?gca|aa ccgcagcgag cccgccgccc ccaccgtgcc caccgccccc 1380 
cccgccgaga gcttccgctt cgaggagacc acccccgccc ccaagcagga gcccaaggac 1440 
cgciaglclt Lcgcgagcc cctgaccgcc ctgcgcagcc tgttcggcag cggccccctg 1500 



agccagtaa 

<210> 5 
<211> 141 
<212> DNA 



2 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Env common 
region of HIV strain AF110968 

<400> 5 

accatcacca tcacctgccg catcaagcag atcatcaaca tgtggcagaa ggtgggccgc 60 
gccatgtacg ccccccccat cgccggcaac ctgacctgcg agagcaacat caccggcctg 120 
ctgctgaccc gcgacggcgg c 141 

<210> 6 
<211> 1431 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl20 coding region of HIV strain AP110968 



<400> 6 

agcgtggtgg 

aagaccaccc 

tgggccaccc 

gtgaccgaga 

atcagcctgt 

ctgaagtgcc 

aagggcgaga 

gaggtgcacg 

gagtaccgcc 

ttcgacccca 

aaccagacct 

ggcatcaagc 

atcatcatcc 

aagcccgtga 

ggccccggcc 

tgcatcatca 

gagcacttca 

accacccaca 

aacagcacct 

atcacctgcc 

gcccccccca 

cgcgacggcg 

atgcgcgaca 

ggcgtggccc 



gcaacctgtg 
tgttctgcac 
acgcctgcgt 
acttcaacat 
gggaccagag 
gcaacgtgaa 
tgaagaactg 
ccctgttcta 
tgatcaactg 
tccccatcca 
tcaacggcac 
ccgtggtgag 
gcagcgagaa 
agatcgtgtg 
agaccttcta 
acaagaccga 
gcaagaaggc 
gcttcaactg 
acagccccag 
gcatcaagca 
tcgccggcaa 
gcaagaccgg 
actggcgcaa 
ccaccgaggc 



ggtgaccgtg 
cagcgacgcc 
gcccaccgac 
gtggaagaac 
cctgaagccc 
cgccaccaac 
cagcttcaac 
ccgcctggac 
caacaccagc 
ctactgcacc 
cggcccctgc 
cacccagctg 
cctggccaac 
cgtgcgcccc 
cgccaccggc 
gtggaacagc 
catcaagttc 
ccgcggcgag 
cttcaacggc 
gatcatcaac 
cctgacctgc 
ccccaacgac 
cgagctgtac 
caagcgccgc 



tactacggcg 
aaggcctacg 
cccaaccccc 
gacatggtgg 
tgcgtgaagc 
aacatcaaca 
gtgaccaccg 
gtggtgcccc 
gccatcaccc 
cccgccggct 
aacaacgtga 
ctgctgaacg 
aacgccaaga 
aacaacaaca 
gagatcatcg 
accctgcagg 
gagcccagca 
ttcttctact 
accgagaaca 
atgtggcaga 
gagagcaaca 
accgagatct 
aagtacaagg 
gtggtggagc 



tgcccgtgtg 
agaccgaggt 
aggagatcgt 
accagatgca 
tgacccccct 
gcatgatcga 
agctgcgcga 
tgcagggcaa 
aggcctgccc 
acgccatcct 
gcagcgtgca 
gcagcctggc 
tcatcatcgt 
cccgcaagag 
gcgacatccg 
gcgtgagcaa 
gcggcggcga 
gcgacaccag 
agctgaacgg 
aggtgggccg 
tcaccggcct 
tccgccccgg 
tggtggagat 
gcgagaagcg 



gaaggaggcc 
gcacaacgtg 
gctggagaac 
cgaggacatc 
gtgcgtgacc 
caacagcaac 
ccgcaagcag 
caacagcaac 
caaggtgagc 
gaagtgcaac 
gtgcgcccac 
caagggcgag 
gcagctgaac 
cgtgcgcatc 
ccaggcctac 
gaagctggag 
cctggagatc 
ccagctgttc 
caccatcacc 
cgccatgtac 
gctgctgacc 
cggcggcgac 
caagcccctg 
c 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1431 



<210> 7 
<211> 1944 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl40 coding region of HIV strain AF110968 

<400> 7 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 



3 



aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 120 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcgt gctggagaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 
atcagcctgt gggaccagag cctgaagccc tgcgtgaagc tgacccccct gtgcgtgacc 300 
ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca gcatgatcga caacagcaac 360 
aagggcgaga tgaagaactg cagcttcaac gtgaccaccg agctgcgcga ccgcaagcag 420 
gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc tgcagggcaa caacagcaac 480 
gagtaccgcc tgatcaactg caacaccagc gccatcaccc aggcctgccc caaggtgagc 540 
ttcgacccca tccccatcca ctactgcacc cccgccggct acgccatcct gaagtgcaac 600 
aaccagacct tcaacggcac cggcccctgc aacaacgtga gcagcgtgca gtgcgcccac 660 
ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg gcagcctggc caagggcgag 720 
at cat cat CO gcagcgagaa cctggccaac aacgccaaga tcatcatcgt gcagctgaac 780 
aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca cccgcaagag cgtgcgcatc 840 
ggccccggcc agaccttcta cgccaccggc gagatcatcg gcgacatccg ccaggcctac 900 
tgcatcatca acaagaccga gtggaacagc accctgcagg gcgtgagcaa gaagctggag 960 
gagcacttca gcaagaaggc catcaagttc gagcccagca gcggcggcga cctggagatc 1020 
accacccaca gcttcaactg ccgcggcgag ttcttctact gcgacaccag ccagctgttc 1080 
aacagcacct acagccccag cttcaacggc accgagaaca agctgaacgg caccatcacc 1140 
atcacctgcc gcatcaagca gatcatcaac atgtggcaga aggtgggccg cgccatgtac 1200 
gcccccccca tcgccggcaa cctgacctgc gagagcaaca tcaccggcct gctgctgacc 1260 
cgcgacggcg gcaagaccgg ccccaacgac accgagatct tccgccccgg cggcggcgac 1320 
atgcgcgaca actggcgcaa cgagctgtac aagtacaagg tggtggagat caagcccctg 1380 
ggcgtggccc ccaccgaggc caagcgccgc gtggtggagc gcgagaagcg cgccgtgggc 1440 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1500 
atcaccctga ccgtgcaggc ccgcctgctg ctgagcggca tcgtgcagca gcagaacaac 1560 
ctgctgcgcg ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag 1620 
cagctgcaga cccgcatcct ggccgtggag cgctacctga aggaccagca gctgctgggc 1680 
atctggggct gcagcggcaa gctgatctgc accaccgccg tgccctggaa cagcagctgg 1740 
agcaaccgca gccacgacga gatctgggac aacatgacct ggatgcagtg ggaccgcgag 1800 
atcaacaact acaccgacac catctaccgc ctgctggagg agagccagaa ccagcaggag 1860 
aagaacgaga aggacctgct ggccctggac agctggcaga acctgtggaa ctggttcagc 1920 
atcaccaact ggctgtggta catc 1944 

<210> 8 
<211> 2466 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl60 coding region of HIV strain AF110968 

<400> 8 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg 
aagaccaccc tgttctgcac cagcgacgcc aaggcctacg 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg 
atcagcctgt gggaccagag cctgaagccc tgcgtgaagc 
ctgaagtgcc gcaacgtgaa cgccaccaac aacatcaaca 
aagggcgaga tgaagaactg cagcttcaac gtgaccaccg 
gaggtgcacg ccctgttcta ccgcctggac gtggtgcccc 
gagtaccgcc tgatcaactg caacaccagc gccatcaccc 
ttcgacccca tcc<^Cr*tcra ctactgcacc cccgccggct 
aaccagacct tcaacggcac cggcccctgc aacaacgtga 
ggcatcaagc ccgtggtgag cacccagctg ctgctgaacg 
atcatcatcc gcagcgagaa cctggccaac aacgccaaga 
aagcccgtga agatcgtgtg cgtgcgcccc aacaacaaca 



tgcccgtgtg gaaggaggcc 60 
agaccgaggt gcacaacgtg 120 
aggagatcgt gctggagaac 180 
accagatgca cgaggacatc 240 
tgacccccct gtgcgtgacc 300 
gcatgatcga caacagcaac 360 
agctgcgcga ccgcaagcag 420 
tgcagggcaa caacagcaac 480 
aggcctgccc caaggtgagc 540 
acgccatcct gaagtgcaac 600 
gcagcgtgca gtgcgcccac 660 
gcagcctggc caagggcgag 720 
tcatcatcgt gcagctgaac 780 
cccgcaagag cgtgcgcatc 840 



4 



ggccccggcc agaccttcta cgccaccggc 
tgcatcatca acaagaccga gtggaacagc 
gagcacttca gcaagaaggc catcaagttc 
accacccaca gcttcaactg ccgcggcgag 
aacagcacct acagccccag cttcaacggc 
atcacctgcc gcatcaagca gatcatcaac 
gcccccccca tcgccggcaa cctgacctgc 
cgcgacggcg gcaagaccgg ccccaacgac 
atgcgcgaca actggcgcaa cgagctgtac 
ggcgtggccc ccaccgaggc caagcgccgc 
atcggcgccg tgttcctggg cttcctgggc 
atcaccctga ccgtgcaggc ccgcctgctg 
ctgctgcgcg ccatcgaggc ccagcagcac 
cagctgcaga cccgcatcct ggccgtggag 
atctggggct gcagcggcaa gctgatctgc 
agcaaccgca gccacgacga gatctgggac 
atcaacaact acaccgacac catctaccgc 
aagaacgaga aggacctgct ggccctggac 
atcaccaact ggctgtggta catcaagatc 
ctgcgcatca tcttcgccgt gctgagcatc 
ctgcccttcc agaccctgac ccccaacccc 
gaggagggcg gcgagcagga ccgcggccgc 
ctggcctggg acgacctgcg cagcctgtgc 
atcctgatcg ccgcccgcgt gctggagctg 
tacctgggca gcctggtgca gtactggggc 
ctggacacca tcgccatcgc cgtggccgag 
cgcatctgcc gcgccatccg caacatcccc 
ctgcag 

<210> 9 
<211> 2547 
<212> DNA 

<213> Artificial Sequence 



gagatcatcg gcgacatccg ccaggcctac 900 
accctgcagg gcgtgagcaa gaagctggag 960 
gagcccagca gcggcggcga cctggagatc 1020 
ttcttctact gcgacaccag ccagctgttc 1080 
accgagaaca agctgaacgg caccatcacc 1140 
atgtggcaga aggtgggccg cgccatgtac 1200 
gagagcaaca tcaccggcct gctgctgacc 1260 
accgagatct tccgccccgg cggcggcgac 1320 
aagtacaagg tggtggagat caagcccctg 1380 
gtggtggagc gcgagaagcg cgccgtgggc 1440 
gccgccggca gcaccatggg cgccgccagc 1500 
ctgagcggca tcgtgcagca gcagaacaac 1560 
ctgctgcagc tgaccgtgtg gggcatcaag 1620 
cgctacctga aggaccagca gctgctgggc 1680 
accaccgccg tgccctggaa cagcagcCgg 1740 
aacatgacct ggatgcagtg ggaccgcgag 1800 
ctgctggagg agagccagaa ccagcaggag 1860 
agctggcaga acctgtggaa ctggttcagc 1920 
ttcatcatga tcgtgggcgg cctgatcggc 1980 
gtgaaccgcg tgcgccaggg ctacagcccc 2040 
cgcgagcccg accgcctggg ccgcatcgag 2100 
agcatccgcc tggtgagcgg cttcctggcc 2160 
ctgttcagct accaccgcct gcgcgacttc 2220 
ctgggccagc gcggctggga ggccctgaag 2280 
ctggagctga agaagagcgc catcagcctg 2340 
ggcaccgacc gcatcatcga gttcatccag 2400 
cgccgcatcc gccagggctt cgaggccgcc 2460 

2466 



<220> 

<223> Description of Artificial Sequence: synthetic 
signal seq[uence and gpl60 coding region of HIV 
strain AF110968 



<400> 9 

atgcgcgtga tgggcatcct gaagaactac 
ttctggatgc tgatcatcag cagcgtggtg 
gtgcccgtgt ggaaggaggc caagaccacc 
gagaccgagg tgcacaacgt gtgggccacc 
caggagatcg tgctggagaa cgtgaccgag 
gaccagatgc acgaggacat catcagcctg 
ctgacccccc tgtgcgtgac cctgaagtgc 
agcatgatcg acaacagcaa caagggcgag 
gagctgcgcg accgcaagca ggaggtgcac 
ctgcagggca acaacagcaa cgagtaccgc 
caggcctgcc ccaaggtgag cttcgacccc 
tacgccatcc tgaagtgcaa caaccagacc 
agcagcgtgc agtgcgccca cggcatcaag 
ggcagcctgg ccaagggcga gatcatcatc 
atcatcatcg tgcagctgaa caagcccgtg 
acccgcaaga gcgtgcgcat cggccccggc 
ggcgacatcc gccaggccta ctgcatcatc 



cagcagtggt ggatgtgggg catcctgggc 60 
ggcaacctgt gggtgaccgt gtactacggc 120 
ctgttctgca ccagcgacgc caaggcctac 180 
cacgcctgcg tgcccaccga ccccaacccc 240 
aacttcaaca tgtggaagaa cgacat'ggtg 300 
tgggaccaga gcctgaagcc ctgcgtgaag 360 
cgcaacgtga acgccaccaa caacatcaac 420 
atgaagaact gcagcttcaa cgtgaccacc 480 
gccctgttct accgcctgga cgtggtgccc 540 
ctgatcaact gcaacaccag cgccatcacc 600 
atccccatcc actactgcac ccccgccggc 660 
ttcaacggca ccggcccctg caacaacgtg 720 
cccgtggtga gcacccagct gctgctgaac 780 
cgcagcgaga acctggccaa caacgccaag 840 
aagatcgtgt gcgtgcgccc caacaacaac 900 
cagaccttct acgccaccgg cgagatcatc 960 
aacaagaccg agtggaacag caccctgcag 1020 



5 



ggcgtgagca agaagctgga ggagcacttc 
agcggcggcg acctggagat caccacccac 
tgcgacacca gccagctgtt caacagcacc 
aagctgaacg gcaccatcac catcacctgc 
aaggtgggcc gcgccatgta cgcccccccc 
atcaccggcc tgctgctgac ccgcgacggc 
ttccgccccg gcggcggcga catgcgcgac 
gtggtggaga tcaagcccct gggcgtggcc 
cgcgagaagc gcgccgtggg catcggcgcc 
agcaccatgg gcgccgccag catcaccctg 
atcgtgcagc agcagaacaa cctgctgcgc 
ctgaccgtgt ggggcatcaa gcagctgcag 
aaggaccagc agctgctggg catctggggc 
gtgccctgga acagcagctg gagcaaccgc 
tggatgcagt gggaccgcga gatcaacaac 
gagagccaga accagcagga gaagaacgag 
aacctgtgga actggttcag catcaccaac 
atcgtgggcg gcctgatcgg cctgcgcatc 
gtgcgccagg gctacagccc cctgcccttc 
gaccgcctgg gccgcatcga ggaggagggc 
ctggtgagcg gcttcctggc cctggcctgg 
taccaccgcc tgcgcgactt catcctgatc 
cgcggctggg aggccctgaa gtacctgggc 
aagaagagcg ccatcagcct gctggacacc 
cgcatcatcg agttcatcca gcgcatctgc 
cgccagggct tcgaggccgc cctgcag 

<210> 10 
<211> 1035 
<212> DNA 

<2a3> Artificial Sequence 



agcaagaagg ccatcaagtt cgagcccagc 1080 
agcttcaact gccgcggcga gttcttctac 1140 
tacagcccca gcttcaacgg caccgagaac 1200 
cgcatcaagc agatcatcaa catgtggcag 1260 
atcgccggca acctgacctg cgagagcaac 1320 
ggcaagaccg gccccaacga caccgagatc 1380 
aactggcgca acgagctgta caagtacaag 1440 
cccaccgagg ccaagcgccg cgtggtggag 1500 
gtgttcctgg gcttcctggg cgccgccggc 1560 
accgtgcagg cccgcctgct gctgagcggc 1620 
gccatcgagg cccagcagca cctgctgcag 1680 
acccgcatcc tggccgtgga gcgctacctg 1740 
tgcagcggca agctgatctg caccaccgcc 1800 
agccacgacg agate tggga caacatgacc 1860 
tacaccgaca ccatctaccg cctgctggag 1920 
aaggacctgc tggccctgga cagctggcag 1980 
tggctgtggt acatcaagat cttcatcatg 2040 
atcttcgccg tgctgagcat cgtgaaccgc 2100 
cagaccctga cccccaaccc ccgcgagccc 2160 
ggcgagcagg accgcggccg cagcatccgc 2220 
gacgacctgc gcagcctgtg cctgttcagc 22 80 
gccgcccgcg tgctggagct gctgggccag 2340 
agcctggtgc agtactgggg cctggagctg 2400 
atcgccatcg ccgtggccga gggcaccgac 2460 
cgcgccatcc gcaacatccc ccgccgcatc 2520 

2547 



<220> 

<223> Description of Artificial Sequence: synthetic a 
gp41 coding region of HIV strain AP110968 

<400> 10 

gccgtgggca tcggcgccgt gttcctgggc ttcctgggcg ccgccggcag caccatgggc 60 
gccgccagca tcaccctgac cgtgcaggcc cgcctgctgc tgagcggcat cgtgcagcag 120 
cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc tgctgcagct gaccgtgtgg 180 
ggcatcaagc agctgcagac ccgcatcctg gccgtggagc gctacctgaa ggaccagcag 240 
ctgctgggca tctggggctg cagcggcaag ctgatctgca ccaccgccgt gccctggaac 300 
agcagctgga gcaaccgcag ccacgacgag atctgggaca acatgacctg gatgcagtgg 360 
gaccgcgaga tcaacaacta caccgacacc atctaccgcc tgctggagga gagccagaac 420 
cagcaggaga agaacgagaa ggacctgctg gccctggaca gctggcagaa cctgtggaac 480 
tggttcagca tcaccaactg gctgtggtac atcaagatct tcatcatgat cgtgggcggc 540 
ctgatcggcc tgcgcatcat cttcgccgtg ctgagcatcg tgaaccgcgt gcgccagggc 600 
tacagccccc tgcccttcca gaccctgacc cccaaccccc gcgagcccga ccgcctgggc 660 
cgcatcgagg aggagggcgg cgagcaggac cgcggccgca gcatccgcct ggtgagcggc 720 
ttcctggccc tggcctggga cgacctgcgc agcctgtgcc tgttcagcta ccaccgcctg 780 
cgcgacttca tcctgatcgc cgcccgcgtg ctggagctgc tgggccagcg cggctgggag 840 
gccctgaagt acctgggcag cctggtgcag tactggggcc tggagctgaa gaagagcgcc 900 
atcagcctgc tggacaccat cgccatcgcc gtggccgagg gcaccqaccg catcatcgag 960 
ttcatccagc gcatctgccg cgccatccgc aacatccccc gccgcatccg ccagggcttc 1020 
gaggccgccc tgcag 1035 



<210> 11 



<211> 144 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Env 
common region of HIV strain AF110975 

<400> 11 

agcatcatca ccctgccctg ccgcatcaag cagatcatcg acatgtggca gaaggtgggc €0 
cgcgccatct acgccccccc catcgagggc aacatcacct gcagcagcag catcaccggc 120 
ctgctgctgg cccgcgacgg cggc 144 

<210> 12 
<211> 1437 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl20 coding region of HIV strain AF110975 

<400> 12 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 
agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 
atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 
ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 
aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 
aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 
agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 
gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 
tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 
acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 
ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 
cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 
atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 
caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 
aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 
ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 
aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 
atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 
gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 
ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 
ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 
cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgc 

<210> 13 
<211> 1950 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl40 coding region of HIV strain AF110975 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1437 
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<400> 13 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg tgcccgtgtg gcgcgaggcc 60 
agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg agaaggaggt gcacaacgtg 120 
tgggccaccc acgcctgcgt gcccaccgac cccaaccccc aggagatcga gctggacaac 180 
gtgaccgaga acttcaacat gtggaagaac gacatggtgg accagatgca cgaggacatc 240 
atcagcctgt gggaccagag cctgaagccc cgcgtgaagc tgacccccct gtgcgtgacc 300 
ctgaagtgca ccaactacag caccaactac agcaacacca tgaacgccac cagctacaac 360 
aacaacacca ccgaggagat caagaactgc accttcaaca tgaccaccga gctgcgcgac 420 
aagaagcagc aggtgtacgc cctgttctac aagctggaca tcgtgcccct gaacagcaac 480 
agcagcgagt accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 540 
gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 600 
tgcaagaaca acaccagcaa cggcaccggc ccctgccaga acgtgagcac cgtgcagtgc 660 
acccacggca tcaagcccgt ggtgagcacc cccctgctgc tgaacggcag cctggccgag 720 
ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 
cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 840 
atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 
caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 
aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 1020 
ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 
aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 
atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 1200 
gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 
ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 1320 
ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 1380 
cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgcgcc 1440 
gtgggcatcg gcgccgtgat cttcggcttc ctgggcgccg ccggcagcaa catgggcgcc 1500 
gccagcatca ccctgaccgc ccaggcccgc cagctgctga gcggcatcgt gcagcagcag 1560 
agcaacctgc tgcgcgccat cgaggcccag cagcacatgc tgcagctgac cgtgtggggc 1620 
atcaagcagc tgcaggcccg cgtgctggcc atcgagcgct acctgaagga ccagcagctg 1680 
ctgggcatct ggggctgcag cggcaagctg atctgcacca ccaccgtgcc ctggaacagc 1740 
agctggagca acaagaccca gggcgagatc tgggagaaca tgacctggat gcagtgggac 1800 
aaggagatca gcaactacac cggcatcatc taccgcctgc tggaggagag ccagaaccag 1860 
caggagcaga acgagaagga cctgctggcc ctggacagcc gcaacaacct gtggagctgg 1920 
ttcaacatca gcaactggct gtggtacatc 1950 

<210> 14 
<211> 2493 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl60 coding region of HIV strain AF110975 

<400> 14 

agcggcctgg gcaacctgtg ggtgaccgtg tacgacggcg 

agcaccaccc tgttctgcgc cagcgacgcc aaggcctacg 

tgggccaccc acgcctgcgt gcccaccgac cccaaccccc 

gtgaccgaga acttcaacat gtggaagaac gacatggtgg 

atcagcctgt gggaccagag cctgaagccc cgcgtgaagc 

ctgaagtgca ccaactacag caccaactac agcaacacca 

aacaacacca ccgaggagat caagaactgc accttcaaca 

aagaagcagc aggtgtacgc cctgttctac aagctggaca 

agcagcgagt accgcctgat caactgcaac accagcgcca 

gtgagcttcg accccatccc catccactac tgcgcccccg 

tgcaagaaca acaccagcaa cggcaccggc ccctgccaga 

acccacggca tcaagcccgt ggtgagcacc cccctgctgc 



tgcccgtgtg gcgcgaggcc 60 
agaaggaggt gcacaacgtg 120 
aggagatcga gctggacaac 180 
accagatgca cgaggacatc 240 
tgacccccct gtgcgtgacc 300 
tgaacgccac cagctacaac 360 
tgaccaccga gctgcgcgac 420 
tcgtgcccct gaacagcaac 480 
tcacccaggc ctgccccaag 540 
ccggctacgc catcctgaag 600 
acgtgagcac cgtgcagtgc 660 
tgaacggcag cctggccgag 720 
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ggcggcgaga tcatcatccg cagcaagaac ctgagcaaca acgcctacac catcatcgtg 780 
cacctgaacg acagcgtgga gatcgtgtgc acccgcccca acaacaacac ccgcaagggc 840 
atccgcatcg gccccggcca gaccttctac gccaccgaga acatcatcgg cgacatccgc 900 
caggcccact gcaacatcag cgccggcgag tggaacaagg ccgtgcagcg cgtgagcgcc 960 
aagctgcgcg agcacttccc caacaagacc atcgagttcc agcccagcag cggcggcgac 1020 
ctggagatca ccacccacag cttcaactgc cgcggcgagt tcttctactg caacaccagc 1080 
aagctgttca acagcagcta caacggcacc agctaccgcg gcaccgagag caacagcagc 1140 
atcatcaccc tgccctgccg catcaagcag atcatcgaca tgtggcagaa ggtgggccgc 1200 
gccatctacg ccccccccat cgagggcaac atcacctgca gcagcagcat caccggcctg 1260 
ctgctggccc gcgacggcgg cctggacaac atcaccaccg agatcttccg cccccagggc 1320 
ggcgacatga aggacaactg gcgcaacgag ctgtacaagt acaaggtggt ggagatcaag 1380 
cccctgggcg tggcccccac cgaggccaag cgccgcgtgg tggagcgcga gaagcgcgcc 1440 
gtgggcatcg gcgccgtgat cttcggcttc ctgggcgccg ccggcagcaa catgggcgcc 1500 
gccagcatca ccctgaccgc ccaggcccgc cagctgctga gcggcatcgt gcagcagcag 1560 
agcaacctgc tgcgcgccat cgaggcccag cagcacatgc tgcagctgac cgtgtggggc 1620 
atcaagcagc tgcaggcccg cgtgctggcc atcgagcgct acctgaagga ccagcagctg 1680 
ctgggcatct ggggctgcag cggcaagctg atctgcacca ccaccgtgcc ctggaacagc 1740 
agctggagca acaagaccca gggcgagatc tgggagaaca tgacctggat gcagtgggac 1800 
aaggagatca gcaactacac cggcatcatc taccgcctgc tggaggagag ccagaaccag 1860 
caggagcaga acgagaagga cctgctggcc ctggacagcc gcaacaacct gtggagctgg 1920 
ttcaacatca gcaactggct gtggtacatc aagatcttca tcatgatcgt gggcggcctg 1980 
atcggcctgc gcatcatctt cgccgtgctg agcatcgtga accgcgtgcg ccagggctac 2040 
agccccctga gcttccagac cctgaccccc aacccccgcg gcctggaccg cctgggccgc 2100 
atcgaggagg agggcggcga gcaggaccgc gaccgcagca tccgcctggt gcagggcttc 2160 
ctggccctgg cctgggacga cctgcgcagc ctgtgcctgt tcagctacca ccgcctgcgc 2220 
gacctgatcc tggtgaccgc ccgcgtggtg gagctgctgg gccgcagcag cccccgcggc 2280 
ctgcagcgcg gctgggaggc cctgaagtac ctgggcagcc tggtgcagta ctggggcctg 2340 
gagctgaaga agagcgccac cagcctgctg gacagcatcg ccatcgccgt ggccgagggc 2400 
accgaccgca tcatcgaggt gatccagcgc atctaccgcg ccttctgcaa catcccccgc 2460 
cgcgtgcgcc agggcttcga ggccgccctg cag 2493 

<210> 15 
<211> 2565 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence and gpl60 coding region of HIV 
strain AF110975 

<400> 15 

atgcgcgtgc gcggcatcct gcgcagctgg 
ttctggatct gcagcggcct gggcaacctg 
tggcgcgagg ccagcaccac cctgttctgc 
gtgcacaacg tgtgggccac ccacgcctgc 
gagctggaca acgtgaccga gaacttcaac 
cacgaggaca tcatcagcct gtgggaccag 
ctgtgcgtga ccctgaagtg caccaactac 
accagctaca acaacaacac caccgaggag 
gagctgcgcg acaagaagca gcaggtgtac 
ctgaacagca acagcagcga gtaccgcctg 
gcctgcccca aggtgagctt cgaccccatc 
gccatcctga agtgceiagaa caacaccagc 
accgtgcagt gcacccacgg catcaagccc 
agcctggccg agggcggcga gatcatcatc 
accatcatcg tgcacctgaa cgacagcgtg 



cagcagtggt ggatctgggg catcctgggc 60 
tgggtgaccg tgtacgacgg cgtgcccgtg 120 
gccagcgacg ccaaggccta cgagaaggag 180 
gtgcccaccg accccaaccc ccaggagatc 240 
atgtggaaga acgacatggt ggaccagatg 300 
agcctgaagc cccgcgtgaa gctgaccccc 360 
agcaccaact acagcaacac catgaacgcc 420 
atcaagaact gcaccttcaa catgaccacc 480 
gccctgttct acaagctgga catcgtgccc 540 
atcaactgca acaccagcgc catcacccag 600 
cccatccact actgcgcccc cgccggctac 660 
aacggcaccg gcccctgcca gaacgtgagc 720 
gtggtgagca cccccctgct gctgaacggc 780 
cgcagcaaga acctgagcaa caacgcctac 840 
gagatcgtgt gcacccgccc caacaacaac 900 
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acccgcaagg gcatccgcat cggccccggc cagaccttct acgccaccga gaacatcatc 960 
ggcgacatcc gccaggccca ctgcaacatc agcgccggcg agtggaacaa ggccgtgcag 1020 
cgcgtgagcg ccaagctgcg cgagcacttc cccaacaaga ccatcgagtt ccagcccagc 1080 
agcggcggcg acctggagat caccacccac agcttcaact gccgcggcga gttcttctac 1140 
tgcaacacca gcaagctgtt caacagcagc tacaacggca ccagctaccg cggcaccgag 1200 
agcaacagca gcatcatcac cctgccctgc cgcatcaagc agatcatcga catgtggcag 1260 
aaggtgggcc gcgccatcta cgcccccccc atcgagggca acatcacctg cagcagcagc 1320 
atcaccggcc tgctgctggc ccgcgacggc ggcctggaca acatcaccac cgagatcttc 1380 
cgcccccagg gcggcgacat gaaggacaac tggcgcaacg agctgtacaa gtacaaggtg 1440 
gtggagatca agcccctggg cgtggccccc accgaggcca agcgccgcgt ggtggagcgc 1500 
gagaagcgcg ccgtgggcat cggcgccgtg atcttcggct tcctgggcgc cgccggcagc 1560 
aacatgggcg ccgccagcat caccctgacc gcccaggccc gccagctgct gagcggcatc 1620 
gtgcagcagc agagcaacct gctgcgcgcc atcgaggccc agcagcacat gctgcagctg 1680 
accgtgtggg gcatcaagca gctgcaggcc cgcgtgctgg ccatcgagcg ctacctgaag 1740 
gaccagcagc tgctgggcat ctggggctgc agcggcaagc tgatctgcac caccaccgtg 1800 
ccctggaaca gcagctggag caacaagacc cagggcgaga tctgggagaa catgacctgg 1860 
atgcagtggg acaaggagat cagcaactac accggcatca tctaccgcct gctggaggag 1920 
agccagaacc agcaggagca gaacgagaag gacctgctgg ccctggacag ccgcaacaac 1980 
ctgtggagct ggttcaacat cagcaactgg ctgtggtaca tcaagatctt catcatgatc 2040 
gtgggcggcc tgatcggcct gcgcatcatc ttcgccgtgc tgagcatcgt gaaccgcgtg 2100 
cgccagggct acagccccct gagcttccag accctgaccc ccaacccccg cggcctggac 2160 
cgcctgggcc gcatcgagga ggagggcggc gagcaggacc gcgaccgcag catccgcctg 2220 
gtgcagggct tcctggccct ggcctgggac gacctgcgca gcctgtgcct gttcagctac 2280 
caccgcctgc gcgacctgat cctggtgacc gcccgcgtgg tggagctgct gggccgcagc 2340 
agcccccgcg gcctgcagcg cggctgggag gccctgaagt acctgggcag cctggtgcag 2400 
tactggggcc tggagctgaa gaagagcgcc accagcctgc tggacagcat cgccatcgcc 2460 
gtggccgagg gcaccgaccg catcatcgag gtgatccagc gcatctaccg cgccttctgc 2520 
aacatccccc gccgcgtgcg ccagggcttc gaggccgccc tgcag 2565 

<210> 16 
<211> 1056 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic a 
gp41 coding region of HIV strain AF110975 

<400> 16 

gccgtgggca tcggcgccgt 
gccgccagca tcaccctgac 
cagagcaacc tgctgcgcgc 
ggcatcaagc agctgcaggc 
ctgctgggca tctggggctg 
agcagctgga gcaacaagac 
gacaaggaga tcagcaacta 
cagcaggagc agaacgagaa 
tggttcaaca . tcagcaactg 
ctgatcggcc tgcgcatcat 
tacagccccc tgagcttcca 
cgcatcgagg aggagggcgg 
ttcctggccc tggcctggga 
cgcgacctga tcctggtgac 
ggcctgcagc gcggctggga 
ctggagctga agaagagcgc 
ggcaccgacc gcatcatcga 
cgccgcgtgc gccagggctt 



gatcttcggc ttcctgggcg 
cgcccaggcc cgccagctgc 
catcgaggcc cagcagcaca 
ccgcgtgctg gccatcgagc 
cagcggcaag ctgatctgca 
ccagggcgag atctgggaga 
caccggcatc atctaccgcc 
ggacctgctg gccctggaca 
gctgtggtac atcaagatct 
cttcgccgtg ctgagcatcg 
gaccctgacc cccaaccccc 
cgagcaggac cgcgaccgca 
cgacctgcgc agcctgtgcc 
cgcccgcgtg gtggagctgc 
ggccctgaag tacctgggca 
caccagcctg ctggacagca 
ggtgatccag cgcatctacc 
cgaggccgcc ctgcag 



ccgccggcag caacatgggc 60 
tgagcggcat cgtgcagcag 120 
tgctgcagct gaccgtgtgg 180 
gctacctgaa ggaccagcag 240 
ccaccaccgt gccctggaac 300 
acatgacctg gatgcagtgg 360 
tgctggagga gagccagaac 420 
gccgcaacaa cctgtggagc 480 
tcatcatgat cgtgggcggc 540 
tgaaccgcgt gcgccagggc 600 
gcggcctgga ccgcctgggc 660 
gcatccgcct ggtgcagggc 720 
tgttcagcta ccaccgcctg 780 
tgggccgcag cagcccccgc 840 
gcctggtgca gtactggggc 900 
tcgccatcgc cgtggccgag 960 
gcgccttctg caacatcccc 1020 

1056 
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<210> 17 
<2ll> 
<212> p^*j^ 

<400> 17 

""J «y Ala Arg Ala Ser He , 

1 5 ^ Gly Lys r.e, . 

Oiu Arg He ^ 10 ^ Asp Ala Trp 

^ lie Arg Leu Arg Pro ni 
20 Pro aiy 

His r 25 ^« ^ Met Met Lys 

His Leu Val xrp Ala q 30 ^ 

35 ^ Arg Giu Leu Glu r„ 

Cly z.euL " ^-Asn Pro 

SO -r Clu Oly cys Lys 01„ xi 

55 J' ^ys Gin He xie Arg oir, r 

His Pro Ala r 60 
65 ""'^ Gin Thr Gly Ser Glu 

70 ^ I*U Lys S^y r 

Val Ala Thr. Phe As„ 

^---CysvalHlsGluLysHeGLvalAr 
— . - -al Arg ASP 



100 "e Glu 

105 

Gin Lys He Gin Gin Ala ri 

115 Ala Ala 



Glu Glu Glu Ql« » 
105 Asa Lys 



110 



Cys Gin 



- -a ASP Lys Gly Lys ^al Ser Gin 
Asn Tyr p^o ^, 125 

130 Asn Leu Gin Gly Gin m . 

135 ' ^'In Met Val Hio 

Ser Pro ^ "0 " Ala 

145 "^'^ ^9 Thr Leu Asn Ala Tm v , 

150 ^ ^ Val Lys Val Ho «, , 

^-l^e ser Pro a. 

"er Pro Glu Val Tl« b 160 
ai ue Pro 

A^aThrPro«, "° ser Glu Gly 

Pro Qin Asp » 175 
180 P Thr Met Leu Asn Th 

Gin Ala Al f Gly His 

Aia Ala Met Gin Mo- . ^^0 

1S5 Gin Met Leu Lys Asp Thr He ^ 

200 Asn Glu Glu Ala Al. 

Trp Asp Arg Val "''^ 
His P 

--et Arg Glu Pro ... « -a Pro Gly 



«y Thr Thr Ser 



Asp He ryr Lys Arg Tm t, 

260 ^ ^ "e He Leu Gly Leu » 

255 ^ Asn Lys He Val 

270 



II 



Arg Met Tyr Ser Pro Val Ser lie Leu Asp lie Lys Gin Gly Pro Lys 
275 280 285 

Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala 
290 295 300 

Glu Gin Ser Thr Gin Glu Val Lys Asn Trp Met Thr Asp Thr Leu Leu 
305 310 315 320 

Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly 
325 330 335 

Pro Gly Ala Ser Leu Glu Glu Met Met Thr Ala Cya Gin Gly Val Gly 
340 345 350 

Gly Pro Ser His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala 
355 360 365 

Asn Thr Ser Val Met Met Gin Lys Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 

He Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Arg Asn 
385 390 395 400 

Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
405 410 415 

His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
420 425 430 

He Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg 
435 440 445 

Pro Glu Pro Thr Ala Pro Pro Ala Glu Ser Phe Arg Phe Glu Glu Thr 
450 455 460 

Thr Pro Gly Gin Lys Gin Glu Ser Lys Asp Arg Glu Thr Leu Thr Ser 
465 470 475 480 



Leu Lys Ser Leu Phe Gly Asn Asp Pro Leu Ser Gin 
485 490 



<210> 18 
<2X1> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence of HIV strain AF110968 

<400> 18 

atgcgcgtga tgggcatcct gaagaactac cagcagtggt ggatgtgggg catcctgggc 60 
ttctggatgc tgatcatcag c 81 

<210> 19 
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<211> 72 
<212> DNA 

<2X3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence of HIV strain AF110975 

<400> 19 

atgcgcgtgc gcggcatcct gcgcagctgg cagcagtggt ggatctgggg catcctgggc 60 
ttctggatct gc 72 

<210> 20 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
coding sequence of HIV strain AP110965 

<400> 20 

atgggcgccc gcgccagcat cctgcgcggc 
ctgcgccccg gcggcaagaa gtgctacatg 
ctggagaagt tcgccctgaa ccccggcctg 
atccgccagc tgcaccccgc cctgcagacc 
accgtggcca ccctgtactg cgtgcacgag 
ctggacaaga tcgaggagga gcagaacaag 
gccgacaagg gcaaggtgag ccagaactac 
gtgcaccagg ccatcagccc ccgcaccctg 
gccttcagcc ccgaggtgat ccccatgttc 
gacctgaaca ccatgctgaa caccgtgggc 
gacaccatca acgaggaggc cgccgagtgg 
atcgcccccg gccagatgcg cgagccccgc 
ctgcaggagc agatcgcctg gatgaccagc 
aagcgctgga tcatcctggg cctgaacaag 
ctggacatca agcagggccc caaggagccc 
accctgcgcg ccgagcagag cacccaggag 
gtgcagaacg ccaaccccga ctgcaagacc 
ctggaggaga tgatgaccgc ctgccagggc 
ctggccgagg ccatgagcca ggccaacacc 
ggcccccgcc gcatcgtgaa gtgcttcaac 
tgccgcgccc cccgcaagaa gggctgctgg 
gactgcaccg agcgccaggc caacttcctg 
cccggcaact tcctgcagag ccgccccgag 
ttcgaggaga ccacccccgg ccagaagcag 
ctgaagagcc tgttcggcaa cgaccccctg 

<210> 21 
<211> 1509 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
coding sequence of HIV strain AP110967 



ggcaagctgg acgcctggga gcgcatccgc €0 
atgaagcacc tggtgtgggc cagccgcgag 120 
ctggagacca gcgagggctg caagcagatc 180 
ggcagcgagg agctgaagag cctgttcaac 240 
aagatcgagg tgcgcgacac caaggaggcc 300 
tgccagcaga agatccagca ggccgaggcc 360 
cccatcgtgc agaacctgca gggccagatg 420 
aacgcctggg tgaaggtgat cgaggagaag 480 
accgccctga gcgagggcgc caccccccag 540 
ggccaccagg ccgccatgca gatgctgaag 600 
gaccgcgtgc accccgtgca cgccggcccc 660 
ggcagcgaca tcgccggcac caccagcacc 720 
aaccccccca tccccgtggg cgacatctac 780 
atcgtgcgca tgtacagccc cgtgagcatc 840 
ttccgcgact acgtggaccg cttcttcaag 900 
gtgaagaact ggatgaccga caccctgctg 960 
atcctgcgcg ccctgggccc cggcgccagc 1020 
gtgggcggcc ccagccacaa ggcccgcgtg 1080 
agcgtgatga tgcagaagag caacttcaag 1140 
tgcggcaagg agggccacat cgcccgcaac 1200 
aagtgcggca aggagggcca ccagatgaag 1260 
ggcaagatct ggcccagcca caagggccgc 1320 
cccaccgccc cccccgccga gagcttccgc 1380 
gagagcaagg accgcgagac cctgaccagc 1440 
agccagtaa 1479 
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<400> 21 

atgggcgccc gcgccagcat cctgcgcggc gagaagctgg acaagtggga gaagatccgc 60 
ctgcgccccg gcggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 
ctggagggct tcgccctgaa ccccggcctg ctggagaccg ccgagggctg caagcagatc 180 
atgaagcagc tgcagcccgc cctgcagacc ggcaccgagg agctgcgcag cctgtacaac 240 
accgtggcca ccctgtactg cgtgcacgcc ggcatcgagg tgcgcgacac caaggaggcc 300 
ctggacaaga tcgaggagga gcagaacaag agccagcaga agacccagca ggccaaggag 360 
gccgacggca aggtgagcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 420 
caccaggcca tcagcccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 480 
ttcagccccg aggtgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 
ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 
accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcaggc cggccccgtg 660 
gcccccggcc agatgcgcga cccccgcggc agcgacatcg ccggcgccac cagcaccctg 720 
caggagcaga tcgcctggat gaccagcaac ccccccgtgc ccgtgggcga catctacaag 780 
cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 840 
gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 
ctgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgagac cctgctggtg 960 
cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccaccctg 1020 
gaggagatga tgaccgcctg ccagggcgtg ggcggccccg gccacaaggc ccgcgtgctg 1080 
gccgaggcca tgagccaggc caacagcgtg aacatcatga tgcagaagag caacttcaag 1140 
ggcccccgcc gcaacgtgaa gtgcttcaac tgcggcaagg agggccacat cgccaagaac 1200 
tgccgcgccc cccgcaagaa gggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 
gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 
cccggcaact tcctgcagaa ccgcagcgag cccgccgccc ccaccgtgcc caccgccccc 1380 
cccgccgaga gcttccgctt cgaggagacc acccccgccc ccaagcagga gcccaaggac 1440 
cgcgagccct accgcgagcc cctgaccgcc ctgcgcagcc tgttcggcag cggccccctg 1500 
agccagtaa ^^^^ 

<210> 22 
<211> 502 
<212> PRT 

<213> Human immunodeficiency virus 



<400> 22 

Met Gly Ala Arg Ala Ser lie Leu Arg Gly Glu Lys Leu Asp Lys Trp 
15 10 15 

Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys His Tyr Met Leu Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Glu Gly Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ala Glu Gly Cys Lys Gin He Met Lys Gin Leu 
50 55 60 

Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Ala Gly He Glu Val Arg Asp 
85 90 95 

Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Gin 
100 105 110 

Gin Lys Thr Gin Gin Ala Lys Glu Ala Asp Gly Lys Val Ser Gin Asn 
115 120 125 
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Tyr Pro lie Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala lie 
130 135 140 

Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val lie Olu Glu Lys Ala 
145 150 155 160 

Phe Ser Pro Glu Val lie Pro Met Phe Thr Ala Leu Ser Glu Gly Ala 
165 170 175 

Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin 
180 185 190 

Ala Ala Met Gin Met Leu Lys Asp Thr lie Asn Glu Glu Ala Ala Glu 
195 200 205 

Trp Asp Arg Leu His Pro Val Gin Ala Gly Pro Val Ala Pro Gly Gin 
210 215 220 

Met Arg Asp Pro Arg Gly Ser Asp lie Ala Gly Ala Thr Ser Thr Leu 
225 230 235 240 

Gin Glu Gin lie Ala Trp Met Thr Ser Asn Pro Pro Val Pro Val Gly 
245 250 255 

Asp lie Tyr Lys Arg Trp lie He Leu Gly Leu Asn Lys He Val Arg 
260 265 270 

Met Tyr Ser Pro Val Ser He Leu Asp He Arg Gin Gly Pro Lys Glu 
275 280 285 

Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala Olu 
290 295 300 

Gin Ala Thr Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val 
305 310 315 320 

Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly Pro 
325 330 335 

Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 
340 345 350 

Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala Asn 
355 360 365 

Ser Val Asn He Met Met Gin Lys Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 

Asn Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Lys Asn 
385 390 395 400 

Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
405 410 415 

His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
420 425 430 
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He Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Asn Arg 
435 440 445 

Ser Glu Pro Ala Ala Pro Thr Val Pro Thr Ala Pro Pro Ala Glu Ser 
450 455 460 

Phe Arg Phe Glu Glu Thr Thr Pro Ala Pro Lys Gin Glu Pro Lys Asp 
465 470 475 480 

Arg Glu Pro Tyr Arg Glu Pro I*eu Thr Ala Leu Arg Ser Leu Phe Gly 
485 490 495 

Ser Gly Pro Leu Ser Gin 



<210> 23 
<211> 849 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 23 

Met Arg Val Met Gly He Leu Lys Asn Tyr Gin Gin Trp Trp Met Trp 
15 10 15 

Gly He Leu Gly Phe Trp Met Leu He He Ser Ser Val Val Gly Asn 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys 
35 40 .45 

Thr Thr Leu Phe Cys Thr Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu He Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asp Met Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp 
100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Lys Cys Arg Asn Val Asn Ala Thr Asn Asn He Asn Ser Met He Asp 
130 135 140 

Asn Ser Asn Lys Gly Glu Met Lys Asn Cys Ser Phe Asn Val Thr Thr 
145 150 155 160 

Glu Leu Arg Asp Arg Lyc Gin Glu Val His Ala L-eu Phe Tyr Arg Leu 



500 



165 



170 



175 



Asp Val Val Pro Leu 
180 



Gin Gly Asn Asn Ser Asn Glu Tyr Arg Leu He 
185 190 
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Asn Cys Asn Thr Ser Ala lie Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 

Asp Pro He Pro He His Tyr Cys Thr Pro Ala Gly Tyr Ala He Leu 
210 215 220 

Lys Cys Asn Asn Gin Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val 
225 230 235 240 

Ser Ser Val Gin Cys Ala His Gly He Lys Pro Val Val Ser Thr Gin 
245 250 255 

Leu Leu Leu Asn Gly Ser Leu Ala Lys Gly Glu He He He Arg Ser 
260 265 270 

Glu Asn Leu Ala Asn Asn Ala Lys He He He Val Gin Leu Asn Lys 
275 280 285 

Pro Val Lys He Val Cys Val Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

Val Arg He Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Glu He He 
305 310 315 320 

Gly Asp He Arg Gin Ala Tyr Cys He He Asn Lys Thr Glu Trp Asn 
325 330 335 

Ser Thr Leu Gin Gly Val Ser Lys Lys Leu Glu Glu His Phe Ser Lys 
340 345 350 

Lys Ala He Lys Phe Glu Pro Ser Ser Gly Gly Asp Leu Glu He Thr 
355 360 365 

Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Ser 
370 375 380 

Gin Leu Phe Asn Ser Thr Tyr Ser Pro Ser Phe Asn Gly Thr Glu Asn 
385 390 395 400 

Lys Leu Asn Gly Thr He Thr He Thr Cys Arg He Lys Gin He He 
405 410 415 

Asn Met Trp Gin Lys Val Gly Arg Ala Met Tyr Ala Pro Pro He Ala 
420 425 430 

Gly Asn Leu Thr Cys Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg 

435 440 445 

Asp Gly Gly Lys Thr Gly Pro Asn Asp Thr Glu He Phe Arg Pro Gly 
450 455 460 

Gly Gly Asp Met Arg Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys 
465 470 475 480 

Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg 
485 490 495 
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Arg Val 



Val 



Glu Arg Glu Lys Arg Ala Val Gly He Gly Ala Val Phe 
500 505 510 



Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser He 
515 520 525 

Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly He Val Gin Gin 

530 535 540 



Gin Asn Asn Leu Leu Arg Ala He 
545 550 

Leu Thr Val Trp Gly He Lys Gin 
565 

Glu Arg Tyr Leu Lys Asp Gin Gin 
580 



Glu Ala Gin Gin His Leu Leu Gin 
555 560 

Leu Gin Thr Arg He Leu Ala Val 
570 575 

Leu Leu Gly He Trp Gly Cys Ser 
585 590 



Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ser Ser Trp Ser 
595 600 605 

Asn Arg Ser His Asp Glu He Trp Asp Asn Met Thr Trp Met Gin Trp 
610 615 620 

Asp Arg Glu He Asn Asn Tyr Thr Asp Thr He Tyr Arg Leu Leu Glu 
625 630 635 640 

Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Ala Leu 
645 650 655 



Asp Ser Trp Gin Asn Leu Trp Asn Trp Phe Ser He Thr Asn Trp Leu 
660 665 670 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 
675 680 685 

Arg He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly 
690 695 700 

Tyr Ser Pro Leu Pro Phe Gin Thr Leu Thr Pro Asn Pro Arg Glu Pro 
705 710 715 720 



Asp Arg Leu Gly Arg He Glu Glu 
725 

Arg Ser He Arg Leu Val Ser Gly 
740 

Leu Arg Ser Leu Cys Leu Phe Ser 
755 760 

Leu He Ala Ala Arg Val Leu Glu 
770 775 

Ala Leu Lys Tyr Leu Gly Ser Leu 
785 790 



Glu Gly Gly Glu Gin Asp Arg Gly 
730 735 

Phe Leu Ala Leu Ala Trp Asp Asp 
745 750 

Tyr His Arg Leu Arg Asp Phe He 
765 

Leu Leu Gly Gin Arg Gly Trp Glu 
780 

Val Gin Tyr Trp Gly Leu Glu Leu 
795 800 
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Lys Lys Ser Ala lie Ser Leu Leu Asp Thr lie Ala He Ala Val Ala 
805 810 815 

Glu Gly Thr Asp Arg He He Glu Phe He Gin Arg He Cys Arg Ala 
820 825 830 

He Arg Asn He Pro Arg Arg He Arg Gin Gly Phe Glu Ala Ala Leu 
835 840 845 

Gin 



<210> 24 
<211> 855 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 24 

Met Arg Val Arg Gly He Leu Arg Ser Trp Gin Gin Trp Trp He Trp 
15 10 15 

Gly He Leu Gly Phe Trp He Cys Ser Gly Leu Gly Asn Leu Trp Val 
20 25 30 

Thr Val Tyr Asp Gly Val Pro Val Trp Arg Glu Ala Ser Thr Thr Leu 
35 40 45 

Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
50 55 60 

Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu He 
65 70 75 80 

Glu Leu Asp Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
85 90 95 

Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu 
100 105 110 

Lys Pro Arg Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys Thr 
115 120 125 

Asn Tyr Ser Thr Asn Tyr Ser Asn Thr Met Asn Ala Thr Ser Tyr Asn 
130 135 140 

Asn Asn Thr Thr Glu Glu He Lys Asn Cys Thr Phe Asn Met Thr Thr 
145 150 155 160 

Glu Leu Arg Asp Lys Lys Gin Gin Val Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 

Acp He Val Pro Leu Asa Ser Asn Ser Ser Glu Tyr Arg Leu He Asn 
180 185 190 

Cys Asn Thr Ser Ala He Thr Gin Ala Cys Pro Lys Val Ser Phe Asp 
195 200 205 
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Pro lie Pro lie His Tyx Cys Ala Pro Ala Gly Tyr Ala lie Leu Lys 
210 215 220 

cys Lys Asn Asn Thr Ser Asn Gly Thr Oly Pro Cys Gin Asn Val Ser 
225 230 235 240 

Thr Val Gin Cys Thr His Oly He Lys Pro Val Val Ser Thr Pro Leu 
245 .250 255 

Leu Leu Asn Gly Ser Leu Ala Glu Gly Gly Glu He He He Arg Ser 
260 265 270 

Lys Asn Leu Ser Asn Asn Ala Tyr Thr He He Val His Leu Asn Asp 
275 280 285 

Ser Val Glu He Val Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly 
290 295 300 

He Arg He Gly Pro Gly Gin Thr Phe Tyr Ala Thr Glu Asn He He 
305 310 315 320 

Gly Asp He Arg Gin Ala His Cys Asn He Ser Ala Gly Glu Trp Asn 
325 330 335 

Lys Ala Val Gin Arg Val Ser Ala Lys Leu Arg Glu His Phe Pro Asn 
340 345 350 

Lys Thr He Glu Phe Gin Pro Ser Ser Gly Gly Asp Leu Glu He Thr 
355 360 365 

Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser 
370 375 380 

Lys Leu Phe Asn Ser Ser Tyr Asn Gly Thr Ser Tyr Arg Gly Thr Glu 
385 390 395 . 400 

Ser Asn Ser Ser He He Thr Leu Pro Cys Arg He Lys Gin He He 
405 410 415 

Asp Met Trp Gin Lys Val Gly Arg Ala He Tyr Ala Pro Pro He Glu 
420 425 430 

Gly Asn He Thr Cys Ser Ser Ser He Thr Gly Leu Leu Leu Ala Arg 
435 440 445 

Asp Gly Gly Leu Asp Asn He Thr Thr Glu He Phe Arg Pro Gin Gly 
450 455 460 

Gly Asp Met Lys Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys Val 
465 470 475 480 

Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg Arg 
485 490 49S 

Val Val Glu Arg Glu Lys Arg Ala Val Gly He Gly Ala Val He Phe 
500 505 510 
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Gly Phe Leu Gly Ala Ala Gly Ser Asn Met Gly Ala Ala Ser lie Thr 
515 520 525 

Leu Thr Ala Gin Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin 
530 535 540 

Ser Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Net Leu Gin Leu 

545 550 555 560 

Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala lie Glu 
565 570 575 

Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly 
580 585 590 

Lys Ijeu lie Cys Thr Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn 
595 600 605 

Lys Thr Gin Gly Glu lie Trp Glu Asn Met Thr Trp Met Gin Trp Asp 
610 615 620 

Lys Glu lie Ser Asn Tyr Thr Gly He He Tyr Arg Leu Leu Glu Glu 
625 630 635 I 640 

Ser Gin Asn Gin Gin Glu Gin Asn Glu Lys Asp Leu Leu Ala Leu Asp 
645 650 655 

Ser Arg Asn Asn Leu Trp Ser Trp Phe Asn He Ser Asn Trp Leu Trp 
660 665 670 

Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu Arg 
675 680 685 

He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr 
690 695 700 

Ser Pro Leu Ser Phe Gin Thr Leu Thr Pro Asn Pro Arg Gly Leu Asp 
705 710 715 720 

Arg Leu Gly Arg He Glu Glu Glu Gly Gly Glu Gin Asp Arg Asp Arg 
725 730 735 

Ser He Arg Leu Val Gin Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu 
740 745 750 

Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu He Leu 
755 760 765 

Val Thr Ala Arg Val Val Glu Leu Leu Gly Arg Ser Ser Pro Arg Gly 
770 775 780 

Leu Gin Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly Ser Leu Val Gin 
785 790 795 800 

Tyr Trp Gly Leu Glu Leu Lys Lys Ser Ala Thr Ser Leu Leu Asp Ser 
805 810 815 
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lie Ala lie Ala Val Ala Glu Gly Thr Asp Arg He He Glu Val He 
820 825 830 

Gin Arg lie Tyr Arg Ala Phe Cys Asn He Pro Arg Arg Val Arg Gin 
835 840 845 

Gly Phe Glu Ala Ala Leu Gin 
850 855 



<210> 25 
<211> 20 
<212> PRT 

<213> Human imnmnodef iciency virus 
<400> 25 

Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
15 10 15 

Phe Phe Lys Thr 
20 



<210> 26 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 26 

gacataaaac aaggaccaaa agagcccttt agagactatg tagaccggtt ctttaaaacc 60 

<210> 27 
<211> 20 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 27 

Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
1 5 10 15 

Phe Phe Lys Thr 
20 



<210> 28 
<211> 47 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 28 

Thr He Thr He Thr Cys Arg He Lys Gin He He Asn Met Trp Gin 
15 10 15 

Lys Val Gly Arg Ala Met Tyr Ala Pro Pro He Ala Gly Asn Leu Thr 
20 25 30 

Cys Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
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35 



40 



45 



<210> 29 
<211> 48 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Ser lie lie Thr Leu Pro Cys Arg lie Lys Gin lie lie Asp Met Trp 
15 10 15 

Gin Lys Val Gly Arg Ala lie Tyr Ala Pro Pro lie Glu Gly Asn lie 
20 25 30 

Thr Cys Ser Ser Ser He Thr Gly Leu Leu Leu Ala Arg Asp Gly Gly 
35 40 45 



<210> 30 
<211> 2469 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PR975(+) 
<400> 30 

gtcgacgcca ccatggccga ggccatgagc caggccacca gcgccaacat cctgatgcag 60 
cgcagcaact tcaagggccc caagcgcatc atcaagtgct tcaactgcgg caaggagggc 120 
cacatcgccc gcaactgccg cgccccccgc aagaagggct gctggaagtg cggcaaggag 180 
ggccaccaga tgaaggactg caccgagcgc caggccaact tcttccgcga ggacctggcc 240 
ttcccccagg gcaaggcccg cgagttcccc agcgagcaga accgcgccaa cagccccacc 300 
agccgcgagc tgcaggtgcg cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 360 
ggcaccctga acttccccca gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 420 
ggcggccaga tcaaggaggc cctgctggac accggcgccg acgacaccgt gctggaggag 480 
atgagcctgc ccggcaagtg gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 
gtgcgccagt acgaccagat cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 
ctgatcggcc ccacccccgt gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 
accctgaact tccccatcag ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 
gacggcccca aggtgaagca gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 
atctgcgagg agatggagaa ggagggcaag atcaccaaga tcggccccga gaacccctac 840 
aacacccccg tgttcgccat caagaagaag gacagcacca agtggcgcaa gctggtggac 900 
ttccgcgagc tgaacaagcg cacccaggac ttctgggagg tgcagctggg catcccccac 960 
cccgccggcc tgaagaagaa gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 1020 
agcgtgcccc tggacgagga cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 
aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 
cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 
gagatcgtga tctaccagta catggacgac ctgtacgtgg gcagcgacct ggagatcggc 1260 
cagcaccgcg ccaagatcga ggagctgcgc aagcacctgc tgcgctgggg cttcaccacc 1320 
cccgacaaga agcaccagaa ggagcccccc ttcctgtgga tgggctacga gctgcacccc 1380 
gacaagtgga ccgtgcagcc catcgagctg cccgagaagg agagctggac cgtgaacgac 1440 
atccagaagc tggtgggcaa gctgaactgg gccagccaga tctaccccgg catca^^ggta 1500 
cgccagctgt gcaagctgct gcgcggcgcc aaggccctga ccgacatcgt gcccctgacc 1560 
gaggaggccg agctggagct ggccgagaac cgcgagatcc tgcgcgagcc cgtgcacggc 1620 
gtgtactacg accccagcaa ggacctggtg gccgagatcc agaagcaggg ccacgaccag 1680 
tggacctacc agatctacca ggagcccttc aagaacctga agaccggcaa gtacgccaag 1740 
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atgcgcaccg cccacaccaa cgacgtgaag 
atggagagca tcgtgatctg gggcaagacc 
acctgggaga cctggtggac cgactactgg 
gtgaacaccc cccccctggt gaagctgtgg 
gccgagacct tctacgtgga cggcgccgcc 
tacgtgaccg accggggccg gcagaagatc 
accgagctgc aggccatcca gctggccctg 
accgacagcc agtacgccct gggcatcatc 
ctggtgaacc agatcatcga gcagctgatc 
cccgcccaca agggcatcgg cggcaacgag 
cgcaaggtgc tgttcctgga cggcatcgat 
gacctgtacg tgggcagcgg cggccctagg 
ggtgaattc 

<210> 31 
<211> 2463 
<212> DNA 

<213> Artificial Sequence 



cagctgaccg aggccgtgca gaagatcgcc 1800 
cccaagttcc gcctgcccat ccagaaggag 1860 
caggccacct ggatccccga gtgggagttc 1920 
taccagctgg agaaggagcc catcatcggc 1980 
aaccgcgaga ccaagatcgg caaggccggc 2040 
gtgagcctga ccgagaccac caaccagaag 2100 
caggacagcg gcagcgaggt gaacatcgtg 2160 
caggcccagc ccgacaagag cgagagcgag 2220 
aagaaggaga aggtgtacct gagctgggtg 2280 
cagatcgaca agctggtgag caagggcatc 2340 
ggcggcatcg tgatctacca gtacatggac 2400 
atcgattaaa agcttcccgg ggctagcacc 2460 

2469 



<220> 

<223> Description of Artificial Sequence: PR975yM 
<400> 31 

gtcgacgcca ccatggccga ggccatgagc caggccacca gcgccaacat cctgatgcag 60 
cgcagcaact tcaagggccc caagcgcatc atcaagtgct tcaactgcgg caaggagggc 120 
cacatcgccc gcaactgccg cgccccccgc aagaagggct gctggaagtg cggcaaggag 180 
ggccaccaga tgaaggactg caccgagcgc caggccaact tcttccgcga ggacctggcc 240 
ttcccccagg gcaaggcccg cgagttcccc agcgagcaga accgcgccaa cagccccacc 300 
agccgcgagc tgcaggtgcg cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 360 
ggcaccctga acttccccca gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 420 
ggcggccaga tcaaggaggc cctgctggac accggcgccg acgacaccgt gctggaggag 480 
atgagcctgc ccggcaagtg gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 
gtgcgccagt acgaccagat cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 
ctgatcggcc ccacccccgt gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 
accctgaact tccccatcag ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 
gacggcccca aggtgaagca gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 
atctgcgagg agatggagaa ggagggcaag atcaccaaga tcggccccga gaacccctac 840 
aacacccccg tgttcgccat caagaagaag gacagcacca agtggcgcaa gctggtggac 900 
ttccgcgagc tgaacaagcg cacccaggac ttctgggagg tgcagctggg catcccccac 960 
cccgccggcc tgaagaagaa gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 1020 
agcgtgcccc tggacgagga cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 
aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 
cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 
gagatcgtga tctaccaggc ccccctgtac gtgggcagcg acctggagat cggccagcac 1260 
cgcgccaaga tcgaggagct gcgcaagcac ctgctgcgct ggggcttcac cacccccgac 1320 
aagaagcacc agaaggagcc ccccttcctg tggatgggct acgagctgca ccccgacaag 1380 
tggaccgtgc agcccatcga gctgcccgag aaggagagct ggaccgtgaa cgacatccag 1440 
aagctggtgg gcaagctgaa ctgggccagc cagatctacc ccggcatcaa ggtgcgccag 1500 
ctgtgcaagc tgctgcgcgg cgccaaggcc ctgaccgaca tcgtgcccct gaccgaggag 1560 
gccgagctgg agctggccga gaaccgcgag atcctgcgcg agcccgtgca cggcgtgtac 1620 
tacgacccca gcaaggacct ggtggccgag atccagaagc agggccacga ccagtggacc 1680 
taccagatct accaggagcc cttcaagaac ctgaagaccg gcaagtacgc caagatgcgc 1740 
accgcccaca ccaacgacgt gaagcagctg accgaggccg tgcagaagat cgccatggag IBOO 
agcatcgtga tctggggcaa gacccccaag ttccgcctgc ccatccagaa ggagacctgg 1860 
gagacctggt ggaccgacta ctggcaggcc acctggatcc ccgagtggga gttcgtgaac 1920 
accccccccc tggtgaagct gtggtaccag ctggagaagg agcccatcat cggcgccgag 1980 
accttctacg tggacggcgc cgccaaccgc gagaccaaga tcggcaaggc cggctacgtg 2040 



24 



accgaccggg gccggcagaa gatcgtgagc ctgaccgaga ccaccaacca gaagaccgag 2X00 
ctgcaggcca tccagctggc cctgcaggac agcggcagcg aggtgaacat cgtgaccgac 2160 
agccagtacg ccctgggcat catccaggcc cagcccgaca agagcgagag cgagctggtg 2220 
aaccagatca tcgagcagct gatcaagaag gagaaggtgt acctgagctg ggtgcccgcc 2280 
cacaagggca tcggcggcaa cgagcagatc gacaagctgg tgagcaaggg catccgcaag 2340 
gtgctgttcc tggacggcat cgatggcggc atcgtgatct accagtacat ggacgacctg 2400 
tacgtgggca gcggcggccc taggatcgat taaaagcttc ccggggctag caccggtgaa 2460 
ttc 2463 

<210> 32 
<211> 2457 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PR975YMWM 
<400> 32 

gtcgacgcca ccatggccga ggccatgagc caggccacca gcgccaacat cctgatgcag 60 
cgcagcaact tcaagggccc caagcgcatc atcaagtgct tcaactgcgg caaggagggc 120 
cacatcgccc gcaactgccg cgccccccgc aagaagggct gctggaagtg cggcaaggag 180 
ggccaccaga tgaaggactg caccgagcgc caggccaact tcttccgcga ggacctggcc 240 
ttcccccagg gcaaggcccg cgagttcccc agcgagcaga accgcgccaa cagccccacc 300 
agccgcgagc tgcaggtgcg cggcgacaac ccccgcagcg aggccggcgc cgagcgccag 360 
ggcaccctga acttccccca gatcaccctg tggcagcgcc ccctggtgag catcaaggtg 420 
ggcggccaga tcaaggaggc cctgctggac accggcgccg acgacaccgt gctggaggag 480 
atgagcctgc ccggcaagtg gaagcccaag atgatcggcg gcatcggcgg cttcatcaag 540 
gtgcgccagt acgaccagat cctgatcgag atctgcggca agaaggccat cggcaccgtg 600 
ctgatcggcc ccacccccgt gaacatcatc ggccgcaaca tgctgaccca gctgggctgc 660 
accctgaact tccccatcag ccccatcgag accgtgcccg tgaagctgaa gcccggcatg 720 
gacggcccca aggtgaagca gtggcccctg accgaggaga agatcaaggc cctgaccgcc 780 
atctgcgagg agatggagaa ggagggcaag atcaccaaga tcggccccga gaacccctac 840 
aacacccccg tgttcgccat caagaagaag gacagcacca agtggcgcaa gctggtggac 900 
ttccgcgagc tgaacaagcg cacccaggac ttctgggagg tgcagctggg catcccccac 960 
cccgccggcc tgaagaagaa gaagagcgtg accgtgctgg acgtgggcga cgcctacttc 1020 
agcgtgcccc tggacgagga cttccgcaag tacaccgcct tcaccatccc cagcatcaac 1080 
aacgagaccc ccggcatccg ctaccagtac aacgtgctgc cccagggctg gaagggcagc 1140 
cccagcatct tccagagcag catgaccaag atcctggagc ccttccgcgc ccgcaacccc 1200 
gagatcgtga tctaccaggc ccccctgtac gtgggcagcg acctggagat cggccagcac 1260 
cgcgccaaga tcgaggagct gcgcaagcac ctgctgcgct ggggcttcac cacccccgac 1320 
aagaagcacc agaaggagcc ccccttcctg cccatcgagc tgcaccccga caagtggacc 1380 
gtgcagccca tcgagctgcc cgagaaggag agctggaccg tgaacgacat ccagaagctg 1440 
gtgggcaagc tgaactgggc cagccagatc taccccggca tcaaggtgcg ccagctgtgc 1500 
aagctgctgc gcggcgccaa ggccctgacc gacatcgtgc ccctgaccga ggaggccgag 1560 
ctggagctgg ccgagaaccg cgagatcctg cgcgagcccg tgcacggcgt gtactacgac 1620 
cccagcaagg acctggtggc cgagatccag aagcagggcc acgaccagtg gacctaccag 1680 
atctaccagg agcccttcaa gaacctgaag accggcaagt acgccaagat gcgcaccgcc 1740 
cacaccaacg acgtgaagca gctgaccgag gccgtgcaga agatcgccat ggagagcatc 1800 
gtgatctggg gcaagacccc caagttccgc ctgcccatcc agaaggagac ctgggagacc 1860 
tggtggaccg actactggca ggccacctgg atccccgagt gggagttcgt gaacaccccc 1920 
cccctggtga agctgtggta ccagctggag aaggagccca tcatcggcgc cgagaccttc 1980 
tacgtggacg gcgccgccaa ccgcgagacc aagatcggca aggccggcta cgtgaccgac 2040 
cggggccggc agaagatcgt gagcctgacc gagaccacca accagaagac cgagctgcag 2100 
gccatccagc tggccctgca ggacagcggc agcgaggtga acatcgtgac cgacagccag 2160 
tacgccctgg gcatcatcca ggcccagccc gacaagagcg agagcgagct ggtgaaccag 2220 
atcatcgagc agctgatcaa gaaggagaag gtgtacctga gctgggtgcc cgcccacaag 2280 
ggcatcggcg gcaacgagca gatcgacaag ctggtgagca agggcatccg caaggtgctg 2340 
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ttcctggacg gcatcgatgg cggcatcgtg atctaccagt acatggacga cctgtacgtg 2400 
ggcagcggcg gccctaggat cgattaaaag cttcccgggg ctagcaccgg tgaattc 2457 

<210> 33 
<211> 9781 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 33 

tggaagggtt aatttactcc aagaaaaggc aagaaatcct tgatttgtgg gtctatcaca 60 
cacaaggctt cttccctgat tggcaaaact acacaccggg gccaggggtc agatatccac 120 
tgacctttgg atggtgctac aagctagtgc cagttgaccc aggggaggtg gaagaggcca 180 
acggaggaga agacaactgt ttgctacacc ctatgagcca acatggagca gaggatgaag 240 
atagagaagt attaaagtgg aagtttgaca gcctcctagc acgcagacac atggcccgcg 300 
agctacatcc ggagtattac aaagactgct gacacagaag ggactttccg cctgggactt 360 
tccactgggg cgttccggga ggtgtggtct gggcgggact tgggagtggt caaccctcag 420 
atgctgcata taagcagctg cttttcgcct gtactgggtc tctctcggta gaccagatct 480 
gagcctggga gccctctggc tatctaggga acccactgct taagcctcaa taaagcttgc 540 
cttgagtgct ttaagtagtg tgtgcccatc tgttgtgtga ctctggtaac tagagatccc 600 
tcagaccctt tgtggtagtg tggaaaatct ctagcagtgg cgcccgaaca gggaccagaa 660 
agtgaaagtg agaccagagg agatctctcg acgcaggact cggcttgctg aagtgcacac 720 
ggcaagaggc gagaggggcg gctggtgagt acgccaattt tacttgacta gcggaggcta 780 
gaaggagaga gatgggtgcg agagcgtcaa tattaagcgg cggaaaatta gataaatggg 840 
aaagaattag gttaaggcca gggggaaaga aacattatat gttaaaacat ctagtatggg 900 
caagcaggga gcCggaaaga tttgcactta accctggcct gttagaaaca tcagaaggct 960 
gtaaacaaat aataaaacag ctacaaccag ctcttcagac aggaacagag gaacttagat 1020 
cattattcaa cacagtagca actctctatt gtgtacataa agggatagag gtacgagaca 1080 
ccaaggaagc cttagacaag atagaggaag aacaaaacaa atgtcagcaa aaagcacaac 1140 
aggcaaaagc agctgacgaa aaggtcagtc aaaattatcc tatagtacag aatgcccaag 1200 
ggcaaatggt acaccaagct atatcaccta gaacattgaa tgcatggata aaagtaatag 1260 
aggaaaaggc tttcaatcca gaggaaatac ccatgtttac agcattatca gaaggagcca 1320 
ccccacaaga tttaaacaca atgttaaata cagtgggggg acatcaagca gccatgcaaa 1380 
tgttaaaaga taccatcaat gaggaggctg cagaatggga taggacacat ccagtacatg 1440 
cagggcctgt tgcaccaggc cagatgagag aaccaagggg aagtgacata gcaggaacta 1500 
ctagtaccct tcaggaacaa atagcatgga tgacaagtaa tccacctatt ccagtagaag 1560 
acatctataa aagatggata attctggggt taaataaaat agtaagaatg tatagccctg 1620 
ttagcatttt ggacataaaa caagggccaa aagaaccctt tagagactat gtagaccggt 1680 
tctttaaaac cttaagagct gaacaagcta cacaagatgt aaagaattgg atgacagaca 1740 
ccttgttggt ccaaaatgcg aacccagatt gtaagaccat ttcaagagca ttaggaccag 1800 
gggcctcatt agaagaaatg atgacagcat gtcagggagt gggaggacct agccataaag 1860 
caagagtgtt ggctgaggca atgagccaag caaacagtaa catactagtg cagagaagca 1920 
attttaaagg ctctaacaga attattaaat gtttcaactg tggcaaagta gggcacatag 1980 
ccagaaattg cagggcccct aggaaaaagg gctgttggaa atgtggacag gaaggacacc 2040 
aaatgaaaga ctgtactgag aggcaggcta attttttagg gaaaatttgg ccttcccaca 2100 
aggggaggcc agggaatttc ctccagaaca gaccagagcc aacagcccca ccagcagaac 2160 
caacagcccc accagcagag agcttcaggt tcgaggagac aacccccgtg ccgaggaagg 2220 
agaaagagag ggaaccttta acttccctca aatcactctt tggcagcgac cccttgtctc 2280 
aataaaagta gagggccaga taaaggaggc tctcttagac acaggagcag atgatacagt 2340 
attagaagaa atagatttgc cagggaaatg gaaaccaaaa atgatagggg gaattggagg 2400 
ttttatcaaa gtaagacagt atgatcaaat acttatagaa atttgtggaa aaaaggctat 2460 
aggtacagta ttagtagggc ctacaccagt caacataatt ggaagaaatc tgttaactca 2520 
gcttggatgc acactaaatt ttccaattag tcctattgaa actgtaccag taaaattaaa 2580 
accaggaatg gatggcccaa aggtcaaaca atggccattg acagaagaaa aaataaaagc 2640 
attaacagca atttgtgagg aaatggagaa ggaaggaaaa attacaaaaa ttgggcctga 2700 
taatccatat aacactccag tatttgccat aaaaaagaag gacagtacta agtggagaaa 2760 
attagtagat ttcagggaac tcaataaaag aactcaagac ttttgggaag ttcaattagg 2820 
aataccacac ccagcaggat taaaaaagaa aaaatcagtg acagtgctag atgtggggga 2880 
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tgcatatttt tcagttcctt tagatgaaag 
tagtataaac aatgaaacac cagggattag 
gaaaggatca ccagcaatat tccagagtag 
aaaaaatcca gacatagtta tctatcaata 
agaaataggg caacatagag caaaaataga 
atttacaaca ccagacaaga aacatcaaaa 
actccatcct gacaaatgga cagtacaacc 
tgtcaatgat atacagaagt tagtgggaaa 
gattaaagta aggcaactct gtaaactcct 
accactaact gaagaagcag aattagaatt 
agtacatgga gtatattatg atccatcaaa 
gcatgaacaa tggacatatc aaatttatca 
gtatgcaaaa atgaggacta cccacactaa 
aaaaatagcc atggaaagca tagtaatatg 
ccaaaaagaa, acatgggaga catggtggac 
gtgggagttt gttaataccc ctcccctagt 
catagcagga gtagaaactt tctatgtaga 
aaaagcaggg tatgttactg acagaggaag 
aaatcagaag actgagttac aagcaattca 
aaacatagta acagactcac agtatgcatt 
tgactcagag atatttaacc aaataataga 
gtcatgggta ccagcacata aaggaattgg 
taagggaatt aggaaagtgt tgtttctaga 
aaggtaccac agcaattgga gagcaatggc 
aaaagaaata gtagctagct gtgataaatg 
agtcgactgt agtccaggga tatggcaatt 
cctggtagca gtccatgtag ctagtggcta 
aggacaagaa acagcatatt ttatattaaa 
acatacagac aatggcagta attttaccag 
aggtatccaa caggaatttg gaattcccta 
catgaataaa gaattaaaga aaataatagg 
gacagcagta caaatggcag tattcattca 
gtacagtgca ggggaaagaa taatagacat 
acaaaaacaa attataagaa ttcaaaattt 
tatttggaaa ggaccagccg aactactctg 
taaaggtgac ataaaggtag taccaaggag 
acagatggca ggtgctgatt gtgtggcagg 
gtttagtaaa gcaccataCg tatatatcaa 
attttgaaag cagacatcca aaagtaagtt 
gattagtaat aaaaacatat tggggtttgc 
atggagtctc catagaatgg agactgagag 
cagaccagct aattcacatg cattattttg 
ccatattagg acacatagtt tttcctaggt 
gatctctgca atacttggca ctgacagcat 
tgcctagtgt tagaaaatta gtagaggata 
gcagagggaa ccatacaatg aatggacact 
tgtcagacac tttcctagac catggctcca 
tggggatact tggacgggag ttgaagctat 
tcatttcaga attggatgcc aacacagcag 
aaatggagcc agtagatcct aaactaaagc 
cagcttgtaa taattgcttt tgcaaacact 
caaaaggttt aggcatttcc tatggcagga 
caagtggtga agatcatcaa aatcctctat 
tggtaagttt aagtttattt aaaggagtag 
tagcactaat catagcaata atagtgtgga 
taagacaaaa gaaaatagac tggttaatta 
gcaatgagag tgatggggac acagaagaat 



cttcaggaaa tatactgcat tcaccatacc 2940 
atatcaatat aatgtgctgc cacagggatg 3000 
catgacaaaa atcttagagc ccttcagagc 3060 
tatggatgac ttgtatgtag gatctgactt 3120 
agagttaagg gaacatttat tgaaatgggg 3180 
agaaccccca tttctttgga tggggtatga 3240 
tatactgctg ccagaaaagg atagttggac 3300 
attaaactgg gcaagtcaga tttacccagg 3360 
caggggggcc aaagcactaa cagacatagt 3420 
ggcagagaac agggaaattt taagagaacc 3480 
agacttgata gctgaaatac agaaacaggg 3540 
agaaccattt aaaaatctga aaacagggaa 3600 
tgatgtaaaa cagttaacag aggcagtgca 3660 
gggaaagact cctaaattta gactacccat 3720 
agactattgg caagccacct ggatccctga 3780 
aaaattatgg taccaactag aaaaagatcc 3840 
tggagcaact aatagggaag ctaaaatagg 3900 
gcagaaaatt gttactctaa ctaacacaac 3960 
gctagctctg caggattcag gatcagaagt 4020 
aggaatcatt caagcacaac cagataagag 4080 
acagttaata aacaaggaaa gaatctacct 4140 
gggaaatgaa caagtagata aattagtaag 4200 
tggaatagat aaagctcaag aagagcatga 4260 
taatgagttt aatctgccac ccatagtagc 4320 
tcagctaaaa ggggaagcca tacatggaca 4380 
agattgtacc catttagagg gaaaaatcat 4440 
catggaagca gaggttatcc cagcagaaac 4500 
attagcagga agatggccag tcaaagtaat 4560 
tactgcagtt aaggcagcct gttggtgggc 4620 
caatccccaa agtcagggag tggtagaatc 4680 
acaag^aaga gatcaagctg agcaccttaa 4740 
caattttaaa agaaaagggg gaattggggg 4800 
aatagcaaca gacatacaaa ctaaagaatt 4860 
tcgggtttat tacagagaca gcagagaccc 4920 
gaaaggtgaa ggggtagtag taatagaaga 4980 
gaaagcaaaa atcattagag attatggaaa 5040 
tggacaggat gaagattaga gcatggaata 5100 
ggagagctag tggatgggtc tacagacatc 5160 
cagaagtaca tatcccatta ggggatgcta 5220 
agacaggaga aagagattgg catttgggtc 5280 
aatacagcac acaagtagac cctgacctgg 5340 
attgttttac agaatctgcc ataagacaag 5400 
gtgactatca agcaggacat aagaaggtag 5460 
tgataaaacc aaaaaagaga aagccacctc 5520 
gatggaacga cccccagaag accaggggcc 5580 
agagattcta gaagaactca agcaggaagc 5640 
tagcttagga caatatatct atgaaaccta 5700 
aataagagta ctgcaacaac tactgttcat 5760 
aataggcatc ttgcgacaga gaagagcaag 5820 
cctggaacca tccaggaagc caacctaaaa 5880 
gtagctatca ttgtctagtt tgctttcaga 5940 
agaagcggag acagcgacga agcgctcctc 6000 
caaagcagta agtacacata gtagatgtaa 6060 
attatagatt aggagtagga gcattgatag 6120 
ccatagcata tatagaatat aggaaattgg 6180 
aaagaattag ggaaagagca gaagacagtg 6240 
tgtcaacaat ggtggatatg gggcatctta 6300 
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ggcttctgga tgctaatgat ttgtaacacg 
gtacctgtgt ggagagaagc aaaaactact 
gagacagaag tgcataatgt ctgggctaca 
caagaaatag ttttgggaaa tgtaacagaa 
gatcagatgc atgaggatat aatcagttta 
ttgaccccac tctgtgtcac tttaaactgt 
gttacaggta atacaaatga taccaatatt 
aaaaattgct ctttcaatgc aaccacagaa 
ctcttttata aacttgatat agtaccactt 
ttaataaatt gcaatacctc aaccataaca 
attcctatac attactgtgc tccagctgat 
ttcaatggga caggaccatg ttataatgtc 
ccagtggtat caactcaact actgttaaat 
agatctgaaa atttgacaga gaataccaaa 
gagattaatt gtacaaggcc caacaataat 
caagcattct atgcaacaaa tgacgtaata 
agtacagata gadggaataa aactttacaa 
cctaataaaa caataaaatt tgaaccacat 
agctttaatt gtagaggaga atttttctat 
tactacccta agaatggtac atacaaatac 
caatgcaaaa taaaacaaat tgtacgcatg 
cctcccattg caggaaacat aacatgtaga 
gatgggggat ttaacaacac aaacaacgac 
gatatgaggg ataactggag aagtgaatta 
ttgggaatag cacccactaa ggcaaaaaga 
ggaataggag ctgtgttcct tgggttcttg 
tcaataacgc tgacggtaca ggccagacaa 
aatttgctga aggctataga ggcgcaacag 
aagcagctcc aggcgagagt cctggctata 
gggatttggg gctgctctgg aagactcatc 
tggagtaata aatctgaagc agatatttgg 
gaaattaata attacacaga aacaatattc 
gaaaagaatg aaaaagattt attagaattg 
gacatatcaa actggctgtg gtatataaaa 
ggtttaagaa taatttttgc tgtgctctct 
cctttgtcat ttcagaccct taccccaagc 
gaagaagaag gtggagagca agacagagac 
tcgcttgcct gggacgatct gcggagcctg 
ttcatattaa ttgcagtgag ggcagtggaa 
cagagggggt gggagatcct taagtatctg 
ctaaaaaaga gtgctattag tccgcttgat 
gataggatta tagaattggt acaaagaatt 
ataagacagg gctttgaagc agctttgcta 
catagttgga tggcctgcag taagagaaag 
agtaggagca gcgtctcaag acttagatag 
tgctactaat gaagcttgtg cctggctgca 
tccagtcaga cctcaggtac ctttaagacc 
cttcttttta aaagaaaagg ggggactgga 
aatccttgat ttgtgggtct ataacacaca 
atcggggcca ggggtccgat tcccactgac 
tgacccaagg gaggtgaaag aggccaatga 
gagccaacat ggagcagagg atgaagatag 
tctagcacac agacacatgg cccgcgagct 
cagaagggac tttccgcctg ggactttcca 
gggacttggg agtggtcacc ctcagatgct 
gggtctctct cggtagacca gatctgagcc 
ctgcttaggc ctcaataaag cttgccttga 



gaggacttgt gggtcacagt ctactatggg 6360 
ctattctgtg catcagatgc taaagcatat 6420 
catgcttgtg tacccacaga ccccaaccca 6480 
aattttaata tgtggaaaaa taacatggca 6540 
tgggatcaaa gcctaaagcc atgtgtaaag 6600 
acagatacaa atgttacagg taatagaact 6660 
gcaaatgcta catataagta tgaagaaatg 6720 
ttaagagata agaaacataa agagtatgca 6780 
aatgaaaata gtaacaactt tacatataga 6840 
caagcctgtc caaaggtctc ttttgacccg 6900 
tatgcgattc taaagtgtaa taataagaca 6960 
agcacagtac aatgtacaca tggaattaag 7020 
ggtagtctag cagaagaagg gataataatt 7080 
acaataatag tacatcttaa tgaatctgta 7140 
acaaggaaaa gtgtaaggat aggaccagga 7200 
ggaaacataa gacaagcaca ttgtaacatt 7260 
caggtaatga aaaaattagg agagcatttc 7320 
gcaggagggg atctagaaat tacaatgcat 7380 
tgcaatacat caaacctgtt taatagtaca 7440 
aatggtaatt caagcttacc catcacactc 7500 
tggcaagggg taggacaagc aatgtatgcc 7560 
tcaaacatca caggaatact attgacacgt 7620 
acagaggaga cattcagacc tggaggagga 7680 
tataaatata aagtggtaga aattaagcca 7740 
agagtggtgc agagaaaaaa aagagcagtg 7800 
ggagcagcag gaagcactat gggcgcagcg 7860 
ctgttgtctg gtatagtgca acagcaaagc 7920 
catatgttgc aactcacagt ctggggcatt 7980 
gaaagatacc taaaggatca acagctccta 8040 
tgcaccactg ctgtgccttg gaactccagt 8100 
gataacatga cttggatgca gtgggataga 8160 
aggttgcttg aagactcgca aaaccagcag 8220 
gacaagtgga ataatctgtg gaattggttt 8280 
atattcataa tgatagtagg aggcttgata 8340 
atagtgaata gagttaggca gggatactca 8400 
ccgaggggac tcgacaggct cggaggaatc 8460 
agatccatac gattggtgag cggattcttg 8520 
tgcctcttca gccaccaccg cttgagagac 8580 
cttctgggac acagcagtct caggggacta 8640 
ggaagtcttg tgcagtattg gggtctagag 8700 
accatagcaa tagcagtagc tgaaggaaca 8760 
tgtagagcta tcctcaacat acctaggaga 8820 
taaaatggga ggcaagtggt caaaacgcag 8880 
aatgagaaga actgagccag cagcagaggg 8940 
acatggggca cttacaagca gcaacacacc 9000 
agcacaagag gaggacggag atgtaggctt 9060 
aatgacttat aagagtgcag tagatctcag 9120 
agggttaatt tactctagga aaaggcaaga 9180 
aggcttcttc cctgattggc aaaactacac 9240 
ctttggatgg tgcttcaagc tagtaccagt 9300 
aggagaagac aactgtttgc tacaccctat 9360 
agaagtatta aagCggaagt ttgacagcct 9420 
acatccggag tattacaaag actgctgaca 9480 
ctggggcgtt ccgggaggtg tggtctgggc 9540 
gcatataagc agctgctttt cgcttgtact 9600 
tgggagctct ctggctatct agggaaccca 9660 
gtgctctaag tagtgtgtgc ccatctgttg 9720 
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tgtgactctg gtaactagag atccctcaga ccctttgtgg tagtgtggaa aatctctagc 9780 
a 9781 



<210> 34 
<211> 203 
<212> ONA 

<213> Human immxmodef iciency virus 
<400> 34 

gctgaggcaa tgagccaagc aaccagcgca aacatactga tgcagagaag caatttcaaa 60 
ggccctaaaa gaattattaa atgtttcaac tgtggcaagg aagggcacat agctagaaat 120 
tgtagggccc ctaggaaaaa aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa 180 
gactgtactg agaggcaggc taa 203 

<210> 35 
<211> 2151 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 35 

ttttttaggg aagatttggc cttcccacaa gggaaggcca gggaatttcc ttcagaacag 60 
aacagagcca acagccccac cagcagagag cttcaagttc gaggagacaa cccccgctcc 120 
gaagcaggag ccgaaagaca gggaaccctt aatttccctc aaatcactct ttggcagcga 180 
ccccttgtct caataaaagt agggggtcaa ataaaggagg ctctcttaga cacaggagct 240 
gatgatacag tattagaaga aatgagtttg ccaggaaaat ggaaaccaaa aatgatagga 300 
ggaattggag gttttatcaa agtaagacag tatgatcaaa tacttataga aatttgtgga 360 
aaaaaggcta taggtacagt attaatagga cctacacctg tcaacataat tggaaggaat 420 
atgttgactc agcttggatg cacactaaat tttccaatta gtcccattga aactgtgcca 480 
gtaaaattaa agccaggaat ggatggccca aaggttaaac aatggccatt gacagaagag 540 
aaaataaaag cattaacagc aatttgtgaa gaaatggaga aagaaggaaa aattacaaaa 600 
attgggcctg aaaatccata taacactcca gtatttgcca taaaaaagaa ggacagtact 660 
aagtggagaa agttagtaga tttcagggaa cttaataaaa gaactcaaga cttttgggaa 720 
gttcaattag gaataccaca cccagcaggg ttaaaaaaga aaaaatcagt gacagtactg 780 
gatgtggggg atgcatattt ttcagttcct ttagatgagg acttcaggaa atatactgca 840 
ttcaccatac ctagtataaa caatgaaaca ccagggatta gatatcaata taatgtgctt 900 
ccacagggat ggaaaggatc accatcaata ttccagagta gcatgacaaa aatcttagag 960 
ccctttagag caagaaatcc agaaatagtc atctatcaat atatggatga cttgtatgta 1020 
ggatctgact tagaaatagg gcaacataga gcaaaaatag aggagttaag aaaacatctg 1080 
ttaaggtggg gatttaccac accggacaag aaacatcaga aagaaccccc atttctttgg 1140 
atggggtatg aactccatcc tgacaaatgg acagtacagc ctatagagtt gccagaaaag 1200 
gaaagctgga ctgtcaatga tatacagaag ttagtgggaa aattaaattg ggccagtcag 1260 
atttacccag gaattaaagt aaggcaactt tgtaaactcc ttaggggggc caaagcacta 1320 
acagatatag taccactaac tgaagaagca gaattagaat tggcagagaa cagggaaatt 1380 
ctaagagaac cagtacatgg agtatattat gacccatcaa aagacttggt agctgaaata 1440 
cagaaacagg ggcatgacca atggacatat caaatttacc aagaaccatt caaaaacctg 1500 
aaaacaggga agtatgcaaa aatgaggact gcccacacta atgatgtaaa acagttaaca 1560 
9aggcagtgc aaaaaatagc tatggaaagc atagtaatat ggggaaagac tcctaaattt 1620 
agactaccca tccaaaaaga aacatgggag acatggtgga cagactattg gcaagccacc 1680 
tggattcctg agtgggagtt tgttaatacc cctcccttag taaaattatg gtaccagcta 1740 
gagaaagaac ccataatagg agcagaaact ttctatgtag atggagcagc taatagggaa 1800 
actaaaatag gaaaagcagg gtatgttact gacagaggaa ggcagaaaat tgtttctcta 1860 
acagaaacaa caaatcagaa gactgaatta caagcaattc agctagcttt gcaagattca 1920 
qqatcagaag taaacatagt aacagactca cagtatgcat taggaatcat tcaagcacaa 1980 
ccagataaga gtgaatcaga gttagtcaac caaataatag aacaattaat aaaaaaggaa 2040 
aaggtctacc tgtcatgggt accagcacat aaaggaattg gaggaaatga acaaatagat 2100 
aaattagtaa gtaagggaat caggaaagtg ctgtttctag atggaataga t 2151 



29 



<210> 36 
<211> 54 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 36 

ggcggcatcg tgatctacca gtacatggac gacctgtacg tgggcagcgg cggc 

<210> 37 
<211> 18 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 37 

Gly Gly He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser 
15 10 15 

Gly Gly 



<210> 38 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
SlFCSacTA 

<400> 38 

gtttcttgag ctctggaagg gttaatttac tccaagaa 

<210> 39 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
SlFTSacTA 

<400> 39 

gtttcttgag ctctggaagg gttaatttac tctaagaa 

<210> 40 
<211> 35 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: primer 
S145RTSalTA 

<400> 40 

gtttcttgtc gacttgtcca tgtatggctt cccct 
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<210> 41 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence? primer 
S145RCSalTA 

<400> 41 

gtttcttgtc gacttgtcca tgcatggctt ccct 34 

<210> 42 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S245PASalTA 

<400> 42 

gtttcttgtc gactgtagtc caggaatatg gcaattag 38 

<210> 43 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S245FGSalTA 

<400> 43 

gtttcttgtc gactgtagtc cagggatatg gcaattag 38 

<210> 44 
<2H> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S2FullNotTA 

<400> 44 

gtttcttgcg gccgctgcta gagattttcc acactacca 39 

<210> 45 
<211> 9738 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 45 

tggaagggtt aatttactcc aggaaaaggc aagagatcct tgatttatgg gtctatcaca 60 
cacaaggcta cttccctgat tggcaaaact acacaccggg accaggggtc agatatccac 120 
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tgacctttgg atggtgcttc aagctagtgc 
acggaggaga agacaactgt ttgctacacc 
acaaagaagt gttacagtgg aagtttgaca 
agctacatcc ggattattac aaagactgct 
tccactgggg cgttccaggg ggagtggtct 
tgctgcatat aagcagcggc ttttcgcctg 
agcctgggag ctctctgtct atctggggaa 
ttgagtgctc taagtagtgt gtgcccatct 
gagatccctc agaccctttg tggtagtgtg 
gacttgaaag cgaaagtgag accagagaag 
gtgcactcgg caagaggcga ggggggcgac 
gcggaggcta gaaggagaga gatgggtgcg 
gacaaatggg aaaaaattag gttacggcca 
ctagtatggg caagcagaga gctggaaaga 
tcagacggat gtagacaaat aataaaacag 
gaaattagat cattatttaa cacagtagca 
gtacgagaca ccaaggaagc cttagacaag 
aaaacacagc aggcggaagc ggctgacaaa 
aacctccaag ggcaaatggt acaccaggcc 
aaagtaatag aggagaaggc ttttagccca 
gaaggagcca ccccacaaga tttaaacacc 
gccatgcaaa tgttaaaaga taccatcaat 
ccagtacatg cagggcctgt tgcaccaggc 
gcaggaacta ctagtaccct tcaagaacaa 
ccagtagggg acatctataa aaggtggata 
tacagccctg tcagcatttt agacataaaa 
gtagaccggt tcttcaaaac tttaagagct 
atgacagaca ccttgttagt ccaaaatgcg 
ttaggaccag gggcttcatt agaagaaatg 
agccacaaag caagagtttt ggctgaggca 
atacagaaaa gcaattttaa aggccctaga 
gaagggcaca tagccaggaa ttgcagggcc 
aaggaaggac accaaatgaa agactgtact 
tggccttccc acaaggggag gccagggaat 
ccaccactag aaccaacagc cccaccagca 
caggagccga aagacaggga acctttaact 
ttgtctcaat aaaagtagcg ggccaaacaa 
atacagtact agaagaaata aacttgccag 
ttggaggttt tatcaaagta agacagtatg 
gggctatagg tacagtatta gtaggaccta 
tgactcagct tggatgcaca ctaaattttc 
aattaaagcc aggaatggat ggcccaaagg 
taaaagcatt aacagaaatt tgtgaggaaa 
ggcctgaaaa tccatataac actccagtat 
ggagaaaatt agtagatttc agggaactca 
aattaggaac accacaccca gcagggttaa 
tgggagatgc atatttttca gtccctttag 
ccatacctag tataaacaat gaaacaccag 
agggatggaa aggatcacca gcaatattcc 
ttagaacaca aaacccagaa gtagttatct 
ctgacttaga aatagggcaa catagagcaa 
aatggggatt taccacacca gacaagaaac 
ggtatgaact ccatcctgac aaatggacag 
gctggactgt caatgatata cagaagttag 
acccagggat taaagtaagg caactgtgta 
acatagtgcc actgactgaa gaagcagaat 
aagaaccagt acatggagta tattatgacc 



cagttgaccc aagggaagta gaagaggcca 180 
ctatgagcca gtatggaatg gatgatgaac 240 
gcagcctagc acgcagacac ctggcccgcg 300 
gacacagaag ggactttccg cctgggactt 360 
SSBcgggact gggagtggcc agccctcaga 420 
tactgggtct ctctaggtag accagatccg 480 
cccactgctt aggcctcaat aaagcttgcc 540 
gttgtgtgac tctggtaact ctggtaacta 600 
gaaaatctct agcagtggcg cccgaacagg 660 
atctctcgac gcaggactcg gcttgctgaa 720 
tggtgagtac gccaaaattt tttttgacta 780 
agagcgtcaa tattaagagg gggaaaatta 840 
999999agaa aacactatat gctaaaacac 900 
tttgcagtta accctggcct tttagagaca 960 
cCacaaccag ctcttcagac aggaacagag 1020 
actctctatt gtgtacataa agggatagat 1080 
atagaggagg aacaaaacaa atgtcagcaa 1140 
aaggtcagtc aaaattatcc tatagtgcag 1200 
atatcaccta gaaccttgaa tgcatgggta 1260 
gaggtaatac ccatgtttac agcattatca 1320 
atgttaaata cagtgggggg acatcaagca 1380 
gaggaggctg cagaatggga taggttacat 1440 
cagatgagag aaccaagggg aagtgacata 1500 
atagcatgga tgacaagtaa cccacctatc 1560 
attctggggt taaataaaat agtaagaatg 1620 
caaggaccaa aggaaccctt tagagactat 1680 
gaacaatcta cacaagaggt aaaaaattgg 1740 
aacccagatt gtaagaccat tttaagagca 1800 
atgacagcat gtcagggagt gggaggacct 1860 
atgagccaag caaacaatac aagtgtaatg 1920 
agagctgtta aatgtttcaa ctgtggcagg 1980 
cctaggaaaa ggggctgttg gaaatgtgga 2040 
gagaggcagg ctaatttttt agggaaaatt 2100 
ttccttcaga gcagaccaga gccaacagcc 2160 
gagagcttca agttcaagga gactccgaag 2220 
tccctcaaat cactctttgg cagcgacccc 2280 
aggaggctct tttagataca ggagcagatg 2340 
gaaaatggaa accaaaaatg ataggaggaa 2400 
atcaaatact tatagaaatt tgtggaaaaa 2460 
cacctgtcaa cataattgga agaaatctgt 2520 
caattagccc cattgaaact gtaccagtaa 2580 
ttaaacaatg gccattgaca gaagaaaaaa 2640 
tggagaagga aggaaaaatt acaaaaattg 2700 
ttgccataaa gaagaaggac agtacaaagt 2760 
ataaaagaac tcaagacttt tgggaagtcc 2820 
aaaagaaaaa atcagtgaca gtactggatg 2880 
atgagagctt cagaaaatat actgcattca 2940 
ggattagata tcaatataat gttcttccac 3000 
agagtagcat gacaagaatc ttagagccct 3060 
atcaatatat ggatgactta tatgtaggat 3120 
aaatagagga gttaagagga cacctattga 3180 
atcagaaaga acccccattt ctttggatgg 3240 
tacagcctat acagctgcca gaaaaggaga 3300 
tgggaaagtt aaactgggca agtcagattt 3360 
aactccttag gggagccaaa gcactaacag 3420 
tagaattggc tgagaacagg gaaattctaa 3480 
catcaaaaga tttaatagct gaaatacaga 3540 
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aacaggggaa tgaccaatgg acatatcaaa 
caggaaagta tgcaaaaatg aggactgccc 
cagtgcaaaa gataacccag gaaagcatag 
tacccatccc aaaagaaaca tgggagacat 
ttcctgagtg ggagtttgtc aatacccctc 
aagaacccat agtaggggca gaaactttct 
aaataggaaa agcagggtat gtcactgaca 
aaacaacaaa tcagaagact gaattacaag 
cagaagtaaa catagtaaca gactcacagt 
ataagagtga atcagaatta gtcagtcaaa 
tctacctatc atgggtacca gcacataaag 
tagtaagtag tggaatcaga aaagtactgt 
agcatgaaaa atatcacagc aattggagag 
tagtagcaaa ggaaatagta gccagctgtg 
atggacaagt cgactgtagt ccaggaatat 
aaatcatcct agtagcagtc catgtagcca 
cagaaacagg acaagaaaca gcatacttta 
aagtaataca tacagataat ggcagtaatt 
ggtgggcaga tatccaacgg gaatttggaa 
tagaatccat gaataaagaa ttaaagaaaa 
accttaagac agcagtacaa atggcagtat 
ttggggggta cagtgcaggg gagagaataa 
aagaattaca aaaacaaatt ataaaaattc 
gagaccctat ttggaaagga ccagccaaac 
tacaagataa tagtgatata aaggtagtac 
atggaaaaca gatggcaggt gctgattgtg 
tggcacagtt tagtaaagca ccatatgtat 
agacatcatt atgaaagcag acacccaaaa 
gatgccaggt tagtaataaa aacatattgg 
ttgggtcacg gagtctccat agaatggaga 
gacctgacag accaactaat tcatatgcat 
aggaaagcca tactaggaca gatagttagc 
aaggtaggat ctctacaata cttggcactg 
ccacctctgc ctagtgttag gaaattagta 
aggggccgca gagggaacca tacaatgaat 
aggaagctgt cagacacttt cctagaccat 
aaacctatgg agatacttgg acaggagttg 
tgtttattca tttcaggatt gggtgccatc 
gagcaagaaa tggagccaat agatcctaac 
cctaaaactg cttgtaatgg gtgttactgt 
tttcagaaaa aaggcttagg catttactat 
gctcctccaa gcaataaaga tcatcaagat 
atatgtaatg ttagatttaa ctgcaagaat 
agtagcacta atcatagcaa taatagtgtg 
ggtaaggcaa aggaaaatag actggttagt 
tggcaatgag agcgaggggg atactgaaga 
taggcttttg gatgctaatg atgtgtaatg 
gggtacctgt ggggagagaa gcaaaaacta 
atgagaaaga agtgcataat gtctgggcta 
cacaagaagt gattttgggc aatgtaacag 
tggatcagat gcaggaagat ataatcagtt 
aattgacccc actctgtgtc actttaaact 
cctctaaaga catgaaaaat tgctctttct 
agaaagaaaa tgcacttttt tatagacttg 
ggaatattaa caactataga ttaataaatt 
caaaagtctc gtttgaccca attcctatac 
taaaatgtaa taataagaaa ttcaatggaa 



tttaccaaga accatttaaa aatctgagaa 3600 
acactaatga tgtgaaacag ttagcagagg 3660 
taatatgggg aaaaactcct aaatttagac 3720 
ggtggtcaga ctattggcaa gccacctgga 3780 
ccctagtaaa attgtggtac cagctggaaa 3840 
atgtagatgg agcagccaat agggaaacta 3900 
aaggaaggca gaaagttgtt tccttcactg 3960 
caattcagct agctttgcag gattcagggc 4020 
atgcattagg aatcattcaa gcacaaccag 4080 
taatagaaca gttgataaaa aaggaaaaag 4140 
gaattggagg aaatgaacaa gtagacaaat 4200 
ttctagatgg aatagataaa gctcaagaag 4260 
caatggctag tgagtttaat ctgccaccca 43i20 
ataaatgtca gctaaaaggg gaagccatgc 4380 
ggcaattaga ctgtacacat ttagaaggaa 4440 
gtggctacat ggaagcagag gttatcccag 4500 
tactaaaatt agcaggaaga tggccagtca 4560 
tcaccagtac cgcagttaag gcagcctgtt 4620 
ttccctacaa tccccaaagt caaggagtag 4680 
tcatagggca agtaagagat caagctgagc 4740 
tcattcacaa ttttaaaaga aaagggggga 4600 
tagacataat agcatcagac atacaaacta 4860 
aaaattttcg ggtttattac agagacagca 4920 
tactctggaa aggtgaaggg gcagtagtaa 4980 
caagaaggaa agcaaaaatc attaaggact 5040 
tggcaggtag acaggatgaa gattagaaca 5100 
gtttcgagga gagctgatgg atggttctac 5160 
gtaagttcag aagtacacat cccattagga 5220 
ggtctgcaga caggagaaag agcttggcat 5280 
ttgagaagat atagcacaca agtagaccct 5340 
tattttgatt gttttgcaga atctgccata 5400 
cctaagtgtg actatcaagc aggacataac 5460 
acagcattga taaaaccaaa aaagataaag 5520 
gaggatagat ggaacaagcc ccagaagacc 5580 
ggacactaga gcttttagaa gaactcaagc 5640 
ggctccataa cttaggacaa catatctatg 5700 
aagcaataat aagaatcctg caacaattac 5760 
atagcagaat aggcattttg cgacagagaa 5820 
ctagaaccct ggaaccatcc aggaagtcag 5880 
aaacgttgca gctatcattg tctagtttgc 5940 
ggcaggaaga agcggagaca gcgacgaagc 6000 
cctctaccaa agcagtaagt accgaatagt 6060 
agattctaga ttaggaatag gagcattgat 6120 
gaccatagta tatatagaat ataggaaatt 6180 
taaaaggaCt agggaaagag cagaagacag 6240 
attatcgaca ctggtggata tggggcatct 6300 
tgaagggctt gtgggtcaca gtctactacg 6360 
ctctattttg tgcatcagat gctaaagcat 6420 
cacatgcctg tgtacccaca gaccccaacc 6480 
aaaattttaa catgtggaaa aatgacatgg 6540 
tatgggatca aagccttaag ccatgtgtaa 6600 
gtacaaatgc aactgttaac tacaataata 6660 
atgtaaccac agaattaaga gataagaaaa 6720 
atatagtacc acttaataat aggaagaatg 6780 
gtaatacctc agccataaca caagcctgtc 6840 
attattgtgc tccagctggt tatgcgcctc 6900 
taggaccatg cgataatgtc agcacagtac 6960 
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aatgtacaca tggaattaag ccagtggtat caactcaatt actgttaaat ggtagcctag 7020 
cagaagaaga gataataatt agatctgaaa atctgacaaa caatgtcaaa acaataatag 7080 
tacatcttaa tgaatctata gagattaaat gtacaagacc tggcaataat acaagaaaga 7140 
gtgtgagaat aggaccagga caagcattct atgcaacagg agacataata ggagatataa 7200 
gacaagcaca ttgtaacatt agtaaaaatg aatggaatac aactttacaa agggtaagtc 7260 
aaaaattaca agaactcttc cctaatagta cagggataaa atttgcacca cactcaggag 7320 
gggacctaga aattactaca catagcttta attgtggagg agaatttttc tattgcaata 7380 
caacagacct gtttaatagt acatacagta atggtacatg cactaatggt acatgcatgt 7440 
ctaataatac agagcgcatc acactccaat gcagaataaa acaaattata aacatgtggc 7500 
aggaggtagg acgagcaatg tatgcccctc ccattgcagg aaacataaca tgtagatcaa 7560 
atattacagg actactatta acacgtgatg gaggagataa taatactgaa acagagacat 7620 
tcagacctgg aggaggagac atgagggaca attggagaag tgaattatat aaatacaagg 7680 
tggtagaaat taaaccatta ggagtagcac ccactgctgc aaaaaggaga gtggtggaga 7740 
gagaaaaaag agcagtagga atiaggagctg tgttccttgg gttcttggga gcagcaggaa 7800 
gcactatggg cgcagcatca ataacgctga cggtacaggc cagacaatta ttgtctggta 7860 
tagtgcaaca gcaaagtaat ttgctgaggg ctatagaggc gcaacagcat atgttgcaac 7920 
tcacggtctg gggcattaag cagctccagg caagagtcct ggctatagag agatacctac 7980 
aggatcaaca gctcctagga ctgtggggct gctctggaaa actcatctgc accactaatg 8040 
tgctttggaa ctctagttgg agtaataaaa ctcaaagtga tatttgggat aacatgacct 8100 
ggatgcagtg ggatagggaa attagtaatt acacaaacac aatatacagg ttgcttgaag 8160 
actcgcaaag ccagcaggaa agaaatgaaa aagatttact agcattggac aggtggaaca 8220 
atctgtggaa ttggtttagc ataacaaatt ggctgtggta tataaaaata ttcataatga 8280 
tagtaggagg cttgataggt ttaagaataa tttttgctgt gctctctcta gtaaatagag 8340 
ttaggcaggg atactcaccc ttgtcattgc agacccttat cccaaacccg aggggacccg 8400 
acaggctcgg aggaatcgaa gaagaaggtg gagagcaaga cagcagcaga tccattcgat 8460 
tagtgagcgg attcttgaca cttgcctggg acgacctacg aagcctgtgc ctcttctgct 8520 
accaccgatt gagagacttc atattaattg tagtgagagc agtggaactt ctgggacaca 8580 
gtagtctcag gggactgcag agggggtggg gaacccttaa gtatttgggg agtcttgtgc 8640 
aatattgggg tctagagtta aaaaagagtg ctattaatct gcttgatact atagcaatag 8700 
cagtagctga aggaacagat aggattctag aattcataca aaacctttgt agaggtatcc 8760 
gcaacgtacc tagaagaata agacagggct tcgaagcagc tttgcaataa aatggggggc 8820 
aagtggtcaa aaagcagtat aattggatgg cctgaagtaa gagaaagaat cagacgaact 8880 
aggtcagcag cagagggagt aggatcagcg tctcaagact tagagaaaca tggggcactt 8940 
acaaccagca acacagccca caacaatgct gcttgcgcct ggctggaagc gcaagaggag 9000 
gaaggagaag taggctttcc agtcagacct caggtacctt taagaccaat gacttataaa 9060 
gcagcaatag atctcagctt ctttttaaaa gaaaaggggg gactggaagg gttaatttac 9120 
tccaagaaaa ggcaagagat ccttgatttg tgggtttata acacacaagg cttcttccct 9180 
gattggcaaa actacacacc gggaccaggg gtcagatttc cactgacctt tggatggtac 9240 
ttcaagctag agccagtcga tccaagggaa gtagaagagg ccaatgaagg agaaaacaac 9300 
tgtttactac accctatgag ccagcaCgga atggaggatg aagacagaga agtattaaga 9360 
tggaagtttg acagtacgct agcacgcaga cacatggccc gcgagctaca tccggagtat 9420 
tacaaagact gctgacacag aagggacttt ccgctgggac tttccactgg ggcgttccag 9480 
gaggtgtggt ctgggcggga caggggagtg gtcagccctg agatgctgca tataagcagc 9540 
tgcttttcgc ctgtactggg tctctctagg tagaccagat ctgagcccgg gagctctctg 9600 
gctatctagg gaacccactg cttaagcctc aataaagctt gccttgagtg ccttgagtag 9660 
tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagacca cttgtggtag 9720 
tgtggaaaap ctctagca 9738 

<210> 46 
<211> 97 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Env Optimized common region short 
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<400> 46 

catcaccctg cagtgcaaga tcaagcagat cgtgcgcatg tggcagggcg tgggccaggc 60 
catgtacgcc ccccccatcg ccggcaacat cacctgc 97 

<210> 47 
<211> 144 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C Env 
Optimized common region 

. <400> 47 

ctgcccatca ccctgcagtg caagatcaag cagatcgtgc gcatgtggca gggcgtgggc 60 
caggccatgt acgccccccc catcgccggc aacatcacct gccgcagcaa catcaccggc 120 
atcctgctga cccgcgacgg cggc 144 

<210> 48 
<211> 144 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C Env 
wild type common region 

<400> 48 

ttacccatca cactccaatg caaaataaaa caaattgtac gcatgtggca aggggtagga 60 

caagcaatgt atgcccctcc cattgcagga aacataacat gtagatcaaa catcacagga 120 
atactattga cacgtgatgg ggga 144 

<210> 49 
<211> 2610 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Envgpl60 optimized 

<400> 49 

atgcgcgtga tgggcaccca gaagaactgc cagcagtggt ggatctgggg catcctgggc 60 
ttctggatgc tgatgatctg caacaccgag gacctgtggg tgaccgtgta ctacggcgtg 120 
cccgtgtggc gcgaggccaa gaccaccctg ttctgcgcca gcgacgccaa ggcctacgag 180 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 240 
gagatcgtgc tgggcaacgt gaccgagaac ttcaacatgt ggaagaacaa catggccgac 300 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 360 
acccccctgt gcgtgaccct gaactgcacc gacaccaacg tgaccggcaa ccgcaccgtg 420 
accggcaaca ccaacgacac caacatcgcc aacgccacct acaagtacga ggagatgaag 480 
aactgcagct tcaacgccac caccgagctg cgcgacaaga agcacaagga gtacgccctg 540 
ttctacaagc tggacatcgt gcccctgaac gagaacagca acaacttcac ctaccgcctg 600 
atcaactgca acaccagcac catcacccag gcctgcccca aggtgagctt cgaccccatc 660 
cccatccact actgcgcccc cgccgactac gccatcctga agtgcaacaa caagaccttc 720 
aacggcaccg gcccctgcta caacgtgagc accgtgcagt gcacccacgg catcaagccc 780 
9tggtgagca cccagctgct gctgaacggc agcctggccg aggagggcat catcatccgc 840 
agcgagaacc tgaccgagaa caccaagacc atcatcgtgc acctgaacga gagcgtggag 900 
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atcaactgca cccgccccaa caacaacacc cgcaagagcg tgcgcatcgg ccccggccag 960 
gccttctacg ccaccaacga cgtgatcggc aacatccgcc aggcccactg caacatcagc 1020 
accgaccgct ggaacaagac cctgcagcag gtgatgaaga agctgggcga gcacttcccc 1080 
aacaagacca tcaagttcga gccccacgcc ggcggcgacc tggagatcac catgcacagc 1140 
ttcaactgcc gcggcgagtt cttctactgc aacaccagca acctgttcaa cagcacctac 1200 
taccccaaga acggcaccta caagtacaac ggcaacagca gcctgcccat caccctgcag 1260 
tgcaagatca agcagatcgt gcgcatgtgg cagggcgtgg gccaggccat gtacgccccc 1320 
cccatcgccg gcaacatcac ctgccgcagc aacatcaccg gcatcctgct gacccgcgac 1380 
ggcggcttca acaacaccaa caacgacacc gaggagacct tccgccccgg cggcggcgac 1440 
atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 
ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcaagaagcg cgccgtgggc 1560 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 
atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 
ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 
cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 
atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg i860 
agcaacaaga gcgaggccga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 
atcaacaact acaccgagac catcttccgc ctgctggagg acagccagaa ccagcaggag 1980 
aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 
atcagcaact ggctgtggta catcaagatc ttcatcatga tcgtgggcgg cctgatcggc 2100 
ctgcgcatca tcttcgccgt gctgagcatc gtgaaccgcg tgcgccaggg ctacagcccc 2160 
ctgagcttcc agaccctgac ccccagcccc cgcggcctgg accgcctggg cggcatcgag 2220 
gaggagggcg gcgagcagga ccgcgaccgc agcatccgcc tggtgagcgg cttcctgagc 2280 
ctggcctggg acgacctgcg cagcctgtgc ctgttcagct accaccgcct gcgcgacttc 2340 
atcctgatcg ccgtgcgcgc cgtggagctg ctgggccaca gcagcctgcg cggcctgcag 2400 
cgcggctggg agatcctgaa gtacctgggc agcctggtgc agtactgggg cctggagctg 2460 
aagaagagcg ccatcagccc cctggacacc atcgccatcg ccgtggccga gggcaccgac 2520 
cgcatcatcg agctggtgca gcgcatctgc cgcgccatcc tgaacatccc ccgccgcatc 2580 
cgccagggct tcgaggccgc cctgctgtaa 2610 

<210> 50 
<211> 2610 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :HIV Type C 
Envgpieo wild type 

<400> 50 

atgagagtga tggggacaca gaagaattgt caacaatggt 
ttctggatgc taatgatttg taacacggag gacttgtggg 
cctgtgtgga gagaagcaaa aactactcta ttctgtgcat 
acagaagtgc ataatgtctg ggctacacat gcttgtgtac 
gaaatagttt tgggaaatgt aacagaaaat tttaatatgt 
cagatgcatg aggatataat cagtttatgg gatcaaagcc 
accccactct gtgtcacttt aaactgtaca gatacaaatg 
acaggtaata caaatgatac caatattgca aatgctacat 
aattgctctt tcaatgcaac cacagaatta agagataaga 
ttttataaac ttgatatagt accacttaat gaaaatagta 
ataaattgca atacctcaac cataacacaa gcctgtccaa 
cctatacatt actgtgctcc agctgattat gcgattctaa 
aatgggacag gaccatgtta taatgtcagc acagtacaat 
gtggtatcaa ctcaactact gttaaatggt agtctagcag 
tctgaaaatt tgacagagaa taccaaaaca ataatagtac 
attaattgta caaggcccaa caataataca aggaaaagtg 
gcattctatg caacaaatga cgtaatagga aacataagac 



ggatatgggg catcttaggc 60 
tcacagtcta ctatggggta 120 
cagatgctaa agcatatgag 180 
ccacagaccc caacccacaa 240 
ggaaaaataa catggcagat 300 
taaagccatg tgtaaagttg 360 
ttacaggtaa tagaactgtt 4 20 
ataagtatga agaaatgaaa 460 
aacataaaga gtatgcactc 540 
acaactttac atatagatta 600 
aggtctcttt tgacccgatt 660 
agtgtaataa taagacattc 720 
gtacacatgg aattaagcca 7ft n 
aagaagggat aataattaga 840 
atcttaatga atctgtagag 900 
taaggatagg accaggacaa 960 
aagcacattg taacattagt 1020 
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acagatagat ggaataaaac tttacaacag gtaatgaaaa aattaggaga gcatttccct 1080 
aataaaacaa taaaatttga "accacatgca ggaggggatc tagaaattac aatgcatagc 1140 
tttaattgta gaggagaatt tttctattgc aatacatcaa acctgtttaa tagtacatac 1200 
taccctaaga atggtacata caaatacaat ggtaattcaa gcttacccat cacactccaa 1260 
tgcaaaataa aacaaattgt acgcatgtgg caaggggtag gacaagcaat gtatgcccct 1320 
cccattgcag gaaacataac atgtagatca aacatcacag gaatactatt gacacgtgat 1380 
gggggattta acaacacaaa caacgacaca gaggagacat tcagacctgg aggaggagat 1440 
atgagggata actggagaag tgaattatat aaatataaag tggtagaaat taagccattg 1500 
ggaatagcac ccactaaggc aaaaagaaga gtggtgcaga gaaaaaaaag agcagtggga 1560 
ataggagctg tgttccttgg gttcttggga gcagcaggaa gcactatggg cgcagcgtca 1620 
ataacgctga cggtacaggc cagacaactg ttgtctggta tagtgcaaca gcaaagcaat 1680 
ttgctgaagg ctatagaggc gcaacagcat atgttgcaac tcacagtctg gggcattaag 1740 
cagctccagg cgagagtcct ggctatagaa agatacctaa aggatcaaca gctcctaggg 1800 
atttggggct gctctggaag actcatctgc accactgctg tgccttggaa ctccagttgg X860 
agtaataaat ctgaagcaga tatttgggat aacatgactt ggatgcagtg ggatagagaa 1920 
attaataatt acacagaaac aatattcagg ttgcttgaag actcgcaaaa ccagcaggaa 1980 
aagaatgaaa aagatttatt agaattggac aagtggaata atctgtggaa ttggtttgac 2040 
atatcaaact ggctgtggta tataaaaata ttcataatga tagtaggagg cttgataggt 2100 
ttaagaataa tttttgctgt gctctctata gtgaatagag ttaggcaggg atactcacct 2160 
ttgtcatttc agacccttac cccaagcccg aggggactcg acaggctcgg aggaatcgaa 2220 
gaagaaggtg gagagcaaga cagagacaga tccatacgat tggtgagcgg attcttgtcg 2280 
cttgcctggg acgatctgcg gagcctgtgc ctcttcagct accaccgctt gagagacttc 2340 
atattaattg cagtgagggc agtggaactt ctgggacaca gcagtctcag gggactacag 2400 
agggggtggg agatccttaa gtatctggga agtcttgtgc agtattgggg tctagagcta 2460 
aaaaagagtg ctattagtcc gcttgatacc atagcaatag cagtagctga aggaacagat 2520 
aggattatag aattggtaca aagaatttgt agagctatcc tcaacatacc taggagaata 2580 
agacagggct ttgaagcagc tttgctataa 2610 

<210> 51 
<211> 1494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Gag optimized 

<400> 51 

atgggcgccc gcgccagcat cctgagcggc ggcaagctgg acaagtggga gcgcatccgc 60 
ctgcgccccg gcggcaagaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 
ctggagcgct tcgccctgaa ccccggcctg ctggagacca gcgagggctg caagcagatc 180 
atcaagcagc tgcagcccgc cctgcagacc ggcaccgagg agctgcgcag cctgttcaac 240 
accgtggcca ccctgtactg cgtgcacaag ggcatcgagg tgcgcgacac caaggaggcc 300 
ctggacaaga tcgaggagga gcagaacaag tgccagcaga aggcccagca ggccaaggcc 360 
gccgacgaga aggtgagcca gaactacccc atcgtgcaga acgcccaggg ccagatggtg 420 
caccaggcca tcagcccccg caccctgaac gcctggatca aggtgatcga ggagaaggcc 480 
ttcaaccccg aggagatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 
ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 
accatcaacg aggaggccgc cgagtgggac cgcacccacc ccgtgcacgc cggccccgtg 660 
gcccccggcc agatgcgcga gccccgcggc agcgacatcg ccggcaccac cagcaccctg 720 
caggagcaga tcgcctggat gaccagcaac ccccccatcc ccgtggagga catctacaag 780 
cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 840 
gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 
ctgcgcgccg agcaggccac ccaggacgtg aagaactgga tgaccgacac cctgctggtg 960 
cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccagcctg 1020 
gaggagatga tgaccgcctg ccagggcgtg ggcggcccca gccacaaggc ccgcgtgctg 1080 
gccgaggcca tgagccaggc caacagcaac atcctggtgc agcgcagcaa cttcaagggc 1140 
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agcaaccgca tcatcaagtg cttcaactgc ggcaaggtgg gccacatcgc ccgcaactgc 1200 

cgcgcccccc gcaagaaggg ctgctggaag tgcggccagg agggccacca gatgaaggac 1260 

tgcaccgagc gccaggccaa cttcctgggc aagatctggc ccagccacaa gggccgcccc 1320 

ggcaacttcc tgcagaaccg ccccgagccc accgcccccc ccgccgagcc caccgccccc 1380 

cccgccgaga gcttccgctt cgaggagacc acccccgtgc cccgcaagga gaaggagcgc 1440 

gagcccctga ccagcctgaa gagcctgttc ggcagcgacc ccctgagcca gtaa 1494 

<210> 52 
<211> 1494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type *C Gag 
Wild Type 

<400> 52 

atgggtgcga gagcgtcaat attaagcggc ggaaaattag ataaatggga aagaattagg 60 
ttaaggccag ggggaaagaa acattatatg ttaaaacatc tagtatgggc aagcagggag 120 
ctggaaagat ttgcacttaa ccctggcctg ttagaaacat cagaaggctg taaacaaata 180 
ataaaacagc tacaaccagc tcttcagaca ggaacagagg aacttagatc attattcaac 240 
acagtagcaa ctctctattg tgtacataaa gggatagagg tacgagacac caaggaagcc 300 
ttagacaaga tagaggaaga acaaaacaaa tgtcagcaaa aagcacaaca ggcaaaagca 360 
gctgacgaaa aggtcagtca aaattatcct atagtacaga atgcccaagg gcaaatggta 420 
caccaagcta tatcacctag aacattgaat gcatggataa aagtaataga ggaaaaggct 480 
ttcaatccag aggaaatacc catgtttaca gcattatcag aaggagccac cccacaagat 540 
ttaaacacaa tgttaaatac agtgggggga catcaagcag ccatgcaaat gttaaaagat 600 
accatcaatg aggaggctgc agaatgggat aggacacatc cagtacatgc agggcctgtt 660 
gcaccaggcc agatgagaga accaagggga agtgacatag caggaactac tagtaccctt 720 
caggaacaaa tagcatggat gacaagtaat ccacctattc cagtagaaga catctataaa 780 
agatggataa ttctggggtt aaataaaata gtaagaatgt atagccctgt tagcattttg 840 
gacataaaac aagggccaaa agaacccttt agagactatg tagaccggtt ctttaaaacc 900 
ttaagagctg aacaagctac acaagatgta aagaattgga tgacagacac cttgttggtc 960 
caaaatgcga acccagattg taagaccatt ttaagagcat taggaccagg ggcctcatta 1020 
gaagaaatga tgacagcatg tcagggagtg ggaggaccta gccataaagc aagagtgttg 1080 
gctgaggcaa tgagccaagc aaacagtaac atactagtgc agagaagcaa ttttaaaggc 1140 
tctaacagaa ttattaaatg tttcaactgt ggcaaagtag ggcacatagc cagaaattgc 1200 
agggccccta ggaaaaaggg ctgttggaaa tgtggacagg aaggacacca aatgaaagac 1260 
tgtactgaga ggcaggctaa ttttttaggg aaaatttggc cttcccacaa ggggaggcca 1320 
gggaatttcc tccagaacag accagagcca acagccccac cagcagaacc aacagcccca 1380 
ccagcagaga gcttcaggtt cgaggagaca acccccgtgc cgaggaagga gaaagagagg 1440 
gaacctttaa cttccctcaa atcactcttt ggcagcgacc ccttgtctca ataa 1494 

<210> 53 
<21X> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C Gag 
Major Homology Region Optimized 

<400> 53 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 54 
<211> 60 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: HIV Type C Gag 
Major Homology Region Wild Type 

<400> 54 

gacataaaac aagggccaaa agaacccttt agagactatg tagaccggtt ctttaaaacc 60 

<210> 55 
<211> 624 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C Nef 
Optimized 

<400> 55 

atgggcggca agtggagcaa gcgcagcatc gtgggctggc ccgccgtgcg cgagcgcatg 60 
cgccgcaccg agcccgccgc cgagggcgtg ggcgccgcca gccaggacct ggaccgccac 120 
ggcgccctga ccagcagcaa cacccccgcc accaacgagg cctgcgcctg gctgcaggcc 180 
caggaggagg acggcgacgt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 
acctacaaga gcgccgtgga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 
ctgatctaca gccgcaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 
ttcttccccg actggcagaa ctacaccagc ggccccggcg tgcgcttccc cctgaccttc 420 
ggctggtgct tcaagctggt gcccgtggac ccccgcgagg tgaaggaggc caacgagggc 480 
gaggacaact gcctgctgca ccccatgagc cagcacggcg ccgaggacga ggaccgcgag 540 
gtgctgaagt ggaagttcga cagcctgctg gcccaccgcc acatggcccg cgagctgcac 600 
cccgagtact acaaggactg ctga g24 

<210> 56 
<211> 624 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C Nef 
Wild Type 

<400> 56 

atgggaggca agtggtcaaa acgcagcata gttggatggc 
agaagaactg agccagcagc agagggagta ggagcagcgt 
ggggcactta caagcagcaa cacacctgct actaatgaag 
caagaggagg acggagatgt aggctttcca gtcagacctc 
acttataaga gtgcagtaga tctcagcttc tttttaaaag 
ttaatttact ctaggaaaag gcaagaaatc cttgatttgt 
ttcttccctg attggcaaaa ctacacatcg gggccagggg 
ggatggtgct tcaagctagt accagttgac ccaagggagg 
gaagacaact gtttgctaca ccctatgagc caacatggag 
gtattaaagt ggaagtttga cagccttcta gcacacagac 
ccggagtatt acaaagactg ctqa 

<210> 57 
<211> 624 
<212> DNA 



ctgcagtaag agaaagaatg 60 
ctcaagactt agatagacat 120 
cttgtgcctg gctgcaagca 180 
aggtaccttt aagaccaatg 24 0 
aaaagggggg actggaaggg 300 
gggtctataa cacacaaggc 360 
tccgattccc actgaccttt 420 
tgaaagaggc caatgaagga 480 
cagaggatga agatagagaa 540 
acatggcccg cgagctacat 600 

624 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
NefD125G Optimized 

<400> 57 

atgggcggca agtggagcaa gcgcagcatc gtgggctggc ccgccgtgcg cgagcgcatg 60 
cgccgcaccg agcccgccgc cgagggcgtg ggcgccgcca gccaggacct ggaccgccac 120 

ggcgccctga ccagcagcaa cacccccgcc accaacgagg cctgcgcctg gctgcaggcc 180 

caggaggagg acggcgacgt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 

acctacaaga gcgccgtgga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 

ctgatctaca gccgcaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 

ttcttccccg gctggcagaa ctacaccagc ggccccggcg tgcgcttccc cctgaccttc 420 

ggctggtgct tcaagctggt gcccgtggac ccccgcgagg tgaaggaggc caacgagggc 480 

gaggacaact gcctgctgca ccccatgagc cagcacggcg ccgaggacga ggaccgcgag 540 

gtgctgaagt ggaagttcga cagcctgctg gcccaccgcc acatggcccg cgagctgcac 600 
cccgagtact acaaggactg ctga g24 

<210> 58 
<211> 354 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
plSRNaseH Optimized 

<400> 58 

accttctacg tggacggcgc caccaaccgc gaggccaaga tcggcaaggc cggctacgtg 60 

accgaccgcg gccgccagaa gatcgtgacc ctgaccaaca ccaccaacca gaagaccgag 120 

ctgcaggcca tccagctggc cctgcaggac agcggcagcg aggtgaacat cgtgaccgac 180 

agccagtacg ccctgggcat catccaggcc cagcccgaca agagcgacag cgagatcttc 240 

aaccagatca tcgagcagct gatcaacaag gagcgcatct acctgagctg ggtgcccgcc 300 

cacaagggca tcggcggcaa cgagcaggtg gacaagctgg tgagcaaggg catc 354 

<210> 59 
<211> 354 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
plSRNaseH Wild Type 

<400> 59 

actttctatg tagatggagc aactaatagg gaagctaaaa taggaaaagc agggtatgtt 60 
actgacagag gaaggcagaa aattgttact ctaactaaca caacaaatca gaagactgag 120 
ttacaagcaa ttcagctagc tctgcaggat tcaggatcag aagtaaacat agtaacagac 180 
tcacagtatg cattaggaat cattcaagca caaccagata agagtgactc agagatattt 240 
aaccaaataa tagaacagtt aataaacaag gaaagaatct acctgtcatg ggtaccagca 300 
cataaaggaa ttgggggaaa tgaacaagta gataaattag taagtaaggg aatt 354 

<210> 60 
<211> 876 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: HIV Type C 
p31Int Optimized 

<400> 60 

cgcaaggtgc tgttcctgga cggcatcgac aaggcccagg aggagcacga gcgctaccac 60 
agcaactggc gcgccatggc caacgagttc aacctgcccc ccatcgtggc caaggagatc 120 
gtggccagct gcgacaagtg ccagctgaag ggcgaggcca tccacggcca ggtggactgc 180 
agccccggca tctggcagct ggactgcacc cacctggagg gcaagatcat cctggtggcc 240 
gtgcacgtgg ccagcggcta catggaggcc gaggtgatcc ccgccgagac cggccaggag 300 
accgcctact tcatcctgaa gctggccggc cgctggcccg tgaaggtgat ccacaccgac 360 
aacggcagca acttcaccag caccgccgtg aaggccgcct gctggtgggc cggcatccag 420 
caggagttcg gcatccccta caacccccag agccagggcg tggtggagag catgaacaag 480 
gagctgaaga agatcatcgg ccaggtgcgc gaccaggccg agcacctgaa gaccgccgtg 540 
cagatggccg tgttcatcca caacttcaag cgcaagggcg gcatcggcgg ctacagcgcc 600 
ggcgagcgca tcatcgacat catcgccacc gacatccaga ccaaggagct gcagaagcag 660 
atcatccgca tccagaactt ccgcgtgtac taccgcgaca gccgcgaccc catctggaag 720 
ggccccgccg agctgctgtg gaagggcgag ggcgtggtgg tgatcgagga caagggcgac 780 
atcaaggtgg tgccccgccg caaggccaag atcatccgcg actacggcaa gcagatggcc 840 
ggcgccgact gcgtggccgg cggccaggac gaggac 876 

<210> 61 
<211> 876 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
p31Int Wild Type 

c400> 61 

aggaaagtgt tgtttctaga tggaatagat aaagctcaag aagagcatga aaggtaccac 60 
agcaattgga gagcaatggc taatgagttt aatctgccac ccatagtagc aaaagaaata 120 
gtagctagct gtgataaatg tcagctaaaa ggggaagcca tacatggaca agtcgactgt 180 
agtccaggga tatggcaatt agattgtacc catttagagg gaaaaatcat cctggtagca 240 
gtccatgtag ctagtggcta catggaagca gaggttatcc cagcagaaac aggacaagaa 300 
acagcatatt ttatattaaa attagcagga agatggccag tcaaagtaat acatacagac 360 
aatggcagta attttaccag tactgcagtt aaggcagcct gttggtgggc aggtatccaa 420 
caggaatttg gaattcccta caatccccaa agtcagggag tggtagaatc catgaataaa 480 
gaattaaaga aaataatagg acaagtaaga gatcaagctg agcaccttaa gacagcagta 540 
caaatggcag tattcattca caattttaaa agaaaagggg gaattggggg gtacagtgca 600 
ggggaaagaa taatagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa 660 
attataagaa ttcaaaattt tcgggtttat tacagagaca gcagagaccc tatttggaaa 720 
ggaccagccg aactactctg gaaaggtgaa ggggtagtag taatagaaga taaaggtgac 780 
ataaaggtag taccaaggag gaaagcaaaa atcattagag attatggaaa acagatggca 840 
ggtgctgatt gtgtggcagg tggacaggat gaagat 876 

<210> 62 
<211> 3015 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Pol Optimized 
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<400> 62 

ttcttccgcg agaacctggc cttcccccag 
acccgcgcca acagccccac cagccgcacc 
cgcggcgaca acccccgcgc cgaggagggc 
atcaccctgt ggcagcgccc cctggtgagc 
ctgctggaca ccggcgccga cgacaccgtg 
aagcccaaga tgatcggcgg catcggcggc 
ctgatcgaga tctgcggcaa gaaggccatc 
aacatcatcg gccgcaacct gctgacccag 
cccatcgaga ccgtgcccgt gaagctgaag 
tggcccctga ccgaggagaa gatcaaggcc 
gagggcaaga tcaccaagat cggccccgac 
aagaagaagg acagcaccaa gtggcgcaag 
acccaggact tctgggaggt gcagctgggc 
aagagcgtga ccgtgctgga cgtgggcgac 
ttccgcaagt acaccgcctt caccatcccc 
taccagtaca acgtgctgcc ccagggctgg 
atgaccaaga tcctggagcc cttccgcgcc 
atggacgacc tgtacgtggg cagcgacctg 
gagctgcgcg agcacctgct gaagtggggc 
gagcccccct tcctgtggat gggctacgag 
atcctgctgc ccgagaagga cagctggacc 
ctgaactggg ccagccagat ctaccccggc 
cgcggcgcca aggccctgac cgacatcgtg 
gccgagaacc gcgagatcct gcgcgagccc 
gacctgatcg ccgagatcca gaagcagggc 
gagcccttca agaacctgaa gaccggcaag 
gacgtgaagc agctgaccga ggccgtgcag 
ggcaagaccc ccaagttccg cctgcccatc 
gactactggc aggccacctg gatccccgag 
aagctgtggt accagctgga gaaggacccc 
ggcgccacca accgcgaggc caagatcggc 
cagaagatcg tgaccctgac caacaccacc 
ctggccctgc aggacagcgg cagcgaggtg 
ggcatcatcc aggcccagcc cgacaagagc 
cagctgatca acaaggagcg catctacctg 
ggcaacgagc aggtggacaa gctggtgagc 
ggcatcgaca aggcccagga ggagcacgag 
aacgagttca acctgccccc catcgtggcc 
cagctgaagg gcgaggccat ccacggccag 
gactgcaccc acctggaggg caagatcatc 
atggaggccg aggtgatccc cgccgagacc 
ctggccggcc gctggcccgt gaaggtgatc 
accgccgtga aggccgcctg ctggtgggcc 
aacccccaga gccagggcgt ggtggagagc 
caggtgcgcg accaggccga gcacctgaag 
aacttcaagc gcaagggcgg catcggcggc 
atcgccaccg acatccagac caaggagctg 
cgcgtgtact accgcgacag ccgcgacccc 
aagggcgagg gcgtggtggt gatcgaggac 
aaggccaaga tcatccgcga ctacggcaag 
ggccaggacg aggac 

<210> 63 
<211> 3015 
<212> DNA 

<213> Artificial Sequence 



ggcgaggccc gcgagttccc ccccgagcag 60 
aacagcccca ccagccgcga gctgcaggtg 120 
gagcgcgagg gcaccttcaa cttcccccag 180 
atcaaggtgg agggccagat caaggaggcc 240 
ctggaggaga tcgacctgcc cggcaagtgg 300 
ttcatcaagg tgcgccagta cgaccagatc 360 
ggcaccgtgc tggtgggccc cacccccgtg 420 
ctgggctgca ccctgaactt ccccatcagc 480 
cccggcatgg acggccccaa ggtgaagcag 540 
ctgaccgcca tctgcgagga gatggagaag 600 
aacccctaca acacccccgt gttcgccatc 660 
ctggtggact tccgcgagct gaacaagcgc 720 
atcccccacc ccgccggcct gaagaagaag 780 
gcc tact tea gcgtgcccct ggacgagagc 840 
agcatcaaca acgagacccc cggcatccgc 900 
aagggcagcc ccgccatctt ccagagcagc 960 
aagaaccccg acatcgtgat ctaccagtac 1020 
gagatcggcc agcaccgcgc caagatcgag 1080. 
ttcaccaccc ccgacaagaa gcaccagaag 1140 
ctgcaccccg acaagtggac cgtgcagccc 1200 
gtgaacgaca tccagaagct ggtgggcaag 1260 
atcaaggtgc gccagctgtg caagctgctg 1320 
cccctgaccg aggaggccga gctggagctg 1380 
gtgcacggcg tgtactacga ccccagcaag 1440 
cacgagcagt ggacctacca gatctaccag 1500 
tacgccaaga tgcgcaccac ccacaccaac 1560 
aagatcgcca tggagagcat cgtgatctgg 1620 
cagaaggaga cctgggagac ctggtggacc 1680 
tgggagttcg tgaacacccc ccccctggtg 1740 
atcgccggcg tggagacctt ctacgtggac 1800 
aaggccggct acgtgaccga ccgcggccgc 1860 
aaccagaaga ccgagctgca ggccatccag 1920 
aacatcgtga ccgacagcca gtacgccctg 1980 
gacagcgaga tcttcaacca gatcatcgag 2040 
agctgggtgc ccgcccacaa gggcatcggc 2100 
aagggcatcc gcaaggtgct gttcctggac 2160 
cgctaccaca gcaactggcg cgccatggcc 2220 
aaggagatcg tggccagctg cgacaagtgc 2280 
gtggactgca gccccggcat ctggcagctg 2340 
ctggtggccg tgcacgtggc cagcggctac 2400 
ggccaggaga ccgcctactt catcctgaag 2460 
cacaccgaca acggcagcaa cttcaccagc 2520 
ggcatccagc aggagttcgg catcccctac 2580 
atgaacaagg agctgaagaa gatcatcggc 2640 
accgccgtgc agatggccgt gttcatccac 2700 
tacagcgccg gcgagcgcat catcgacatc 2760 
cagaagcaga tcatccgcat ccagaacttc 2820 
atctggaagg gccccgccga gctgctgtgg 2880 
aagggcgaca tcaaggtggt gccccgccgc 2940 
cagatggccg gcgccgactg cgtggccggc 3000 

3015 
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<220> 

<223> Description of Artificial Sequence: HIV Type C 
Pol Wild Type 



<400> 63 

ttttttaggg aaaatttggc cttcccacaa 
accagagcca acagccccac cagcagaacc 
cgaggagaca acccccgtgc cgaggaagga 
atcactcttt ggcagcgacc ccttgtctca 
ctcttagaca caggagcaga tgatacagta 
aaaccaaaaa tgataggggg aattggaggt 
cttatagaaa tttgtggaaa aaaggctata 
aacataattg gaagaaatct gttaactcag 
cctattgaaa ctgtaccagt aaaattaaaa 
tggccattga cagaagaaaa aataaaagca 
gaaggaaaaa ttacaaaaat tgggcctgat 
aaaaagaagg acagtactaa gtggagaaaa 
actcaagact tttgggaagt tcaattagga 
aaatcagtga cagtgctaga tgtgggggat 
ttcaggaaat atactgcatt caccatacct 
tatcaatata atgtgctgcc acagggatgg 
atgacaaaaa tcttagagcc cttcagagca 
atggatgact tgtatgtagg atctgactta 
gagttaaggg aacatttatt gaaatgggga 
gaacccccat ttctttggat ggggtatgaa 
atactgctgc cagaaaagga tagttggact 
ttaaactggg caagtcagat ttacccaggg 
aggggggcca aagcactaac agacatagta 
gcagagaaca gggaaatttt aagagaacca 
gacttgatag ctgaaataca gaaacagggg 
gaaccattta aaaatctgaa aacagggaag 
gatgtaaaac agttaacaga ggcagtgcaa 
ggaaagactc ctaaatttag actacccatc 
gactattggc aagccacctg gatccctgag 
aaattatggt accaactaga aaaagatccc 
ggagcaacta atagggaagc taaaatagga 
cagaaaattg ttactctaac taacacaaca 
ctagctctgc aggattcagg atcagaagta 
ggaatcattc aagcacaacc agataagagt 
cagttaataa acaaggaaag aatctacctg 
ggaaatgaac aagtagataa attagtaagt 
ggaatagata aagctcaaga agagcatgaa 
aatgagttta atctgccacc catagtagca 
cagctaaaag gggaagccat acatggacaa 
gattgtaccc atttagaggg aaaaatcatc 
atggaagcag aggttatccc agcagaaaca 
ttagcaggaa gatggccagt caaagtaata 
actgcagtta aggcagcctg ttggtgggca 
aatccccaaa gtcagggagt ggtagaatcc 
caagtaagag atcaagctga gcaccttaag 
aattttaaaa gaaaaggggg aattgggggg 
atagcaacag acatacaaac taaagaatta 
cgggtttatt acagagacag cagagaccct 
aaaggtgaag gggtagtagt aatagaagat 
aaagcaaaaa tcattagaga ttatggaaaa 
ggacaggatg aagat 



ggggaggcca gggaatttcc tccagaacag 60 
aacagcccca ccagcagaga gcttcaggtt 120 
gaaagagagg gaacctttaa cttccctcaa 180 
ataaaagtag agggccagat aaaggaggct 240 
ttagaagaaa tagatttgcc agggaaatgg 300 
tttatcaaag taagacagta tgatcaaata 360 
ggtacagtat tagtagggcc tacaccagtc 420 
cttggatgca cactaaattt tccaattagt 480 
ccaggaatgg atggcccaaa ggtcaaacaa 540 
ttaacagcaa tttgtgagga aatggagaag 600 
aatccatata acactccagt atttgccata 660 
ttagtagatt tcagggaact caataaaaga 720 
ataccacacc cagcaggatt aaaaaagaaa 780 
gcatattttt cagttccttt agatgaaagc 840 
agtataaaca atgaaacacc agggattaga 900 
aaaggatcac cagcaatatt ccagagtagc 960 
aaaaatccag acatagttat ctatcaatat 1020 
gaaatagggc aacatagagc aaaaatagaa 1080 
tttacaacac cagacaagaa acatcaaaaa 1140 
ctccatcctg acaaatggac agtacaacct 1200 
gtcaatgata tacagaagtt agtgggaaaa 1260 
attaaagtaa ggcaactctg taaactcctc 1320 
ccactaactg aagaagcaga attagaattg 1380 
gtacatggag tatattatga tccatcaaaa 1440 
catgaacaat ggacatatca aatttatcaa 1500 
tatgcaaaaa tgaggactac ccacactaat 1560 
aaaatagcca tggaaagcat agtaatatgg 1620 
caaaaagaaa catgggagac atggtggaca 1680 
tgggagtttg ttaatacccc tcccctagta 1740 
atagcaggag tagaaacttt ctatgtagat 1800 
aaagcagggt atgttactga cagaggaagg 1860 
aatcagaaga ctgagttaca agcaattcag 1920 
aacatagtaa cagactcaca gtatgcatta 1980 
gactcagaga tatttaacca aataatagaa 2040 
tcatgggtac cagcacataa aggaattggg 2100 
aagggaatta ggaaagtgtt gtttctagat 2160 
aggtaccaca gcaattggag agcaatggct 2220 
aaagaaatag tagctagctg tgataaatgt 2280 
gtcgactgta gtccagggat atggcaatta 2340 
ctggtagcag tccatgtagc tagtggctac 2400 
ggacaagaaa cagcatattt tatattaaaa 2460 
catacagaca atggcagtaa ttttaccagt 2520 
ggtatccaac aggaatttgg aattccctac 2580 
atgaataaag aattaaagaa aataatagga 2640 
acagcagtac aaatggcagt attcattcac 2700 
tacagtgcag gggaaagaat aatagacata 2760 
caaaaacaaa ttataagaat tcaaaatttt 2820 
atttggaaag gaccagccga actactctgg 2880 
aaaggtgaca taaaggtagt accaaggagg 2940 
cagatggcag gtgctgattg tgtggcaggt 3000 

3015 
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<210> 64 
<211> 297 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Protease Optimized 

<400> 64 

ccccagatca ccctgtggca gcgccccctg gtgagcatca aggtggaggg ccagatcaag 60 

gaggccctgc tggacaccgg cgccgacgac accgtgctgg aggagatcga cctgcccggc 120 

aagtggaagc ccaagatgat cggcggcatc ggcggcttca tcaaggtgcg ccagtacgac 180 

cagatcctga tcgagatctg cggcaagaag gccatcggca ccgtgctggt gggccccacc 240 

cccgtgaaca tcatcggccg caacctgctg acccagctgg gctgcaccct gaacttc 297 

<210> 65 
<211> 297 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Protease Wild Type 

<400> 65 

cctcaaatca ctctttggca gcgacccctt gtctcaataa aagtagaggg 
gaggctctct tagacacagg agcagatgat acagtattag aagaaataga 
aaatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag 
caaatactta tagaaatttg tggaaaaaag gctataggta cagtattagt 
ccagtcaaca taattggaag aaatctgtta actcagcttg gatgcacact 

<210> 66 
<211> 297 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Inactivated Protease Optimized 

<400> 66 

ccccagatca ccctgtggca gcgccccctg gtgagcatca aggtggaggg 
gaggccctgc tggccaccgg cgccgacgac accgtgctgg aggagatcga 
aagtggaagc ccaagatgat cggcggcatc ggcggcttca tcaaggtgcg 
cagatcctga tcgagatctg cggcaagaag gccatcggca ccgtgctggt 
cccgtgaaca tcatcggccg caacctgctg acccagctgg gctgcaccct 

<210> 67 
<211> 297 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Inactivated Protease Wild Type 



ccagataaag 60 
tttgccaggg 120 
acagtatgat 180 
agggcctaca 240 
aaatttt 297 



ccagatcaag 60 
cctgcccggc 120 
ccagtacgac 180 
gggccccacc 240 
gaacttc 297 
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<400> 67 

cctcaaatca ctctttggca gcgacccctt gtctcaataa aagtagaggg ccagataaag 60 
gaggctctct tagccacagg agcagatgat acagtattag aagaaataga tttgccaggg 120 
aaatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat 180 
caaatactta tagaaatttg tggaaaaaag gctataggta cagtattagt agggcctaca 240 
ccagtcaaca taattggaag aaatctgtta actcagcttg gatgcacact aaatttt 297 

<210> 68 
<211> 1965 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: HIV Type C 
Inactivated Protease Mutated Reverse 
Transcriptase Optimized 



<40O> 68 

ccccagatca ccctgtggca gcgccccctg 
gaggccctgc tggccaccgg cgccgacgac 
aagtggaagc ccaagatgat cggcggcatc 
cagatcctga tcgagatctg cggcaagaag 
cccgtgaaca tcatcggccg caacctgctg 
atcagcccca tcgagaccgt gcccgtgaag 
aagcagtggc ccctgaccga ggagaagatc 
gagaaggagg gcaagatcac caagatcggc 
gccatcaaga agaaggacag caccaagtgg 
aagcgcaccc aggacttctg ggaggtgcag 
aagaagaaga gcgtgaccgt gctggacgtg 
gagagcttcc gcaagtacac cgccttcacc 
atccgctacc agtacaacgt gctgccccag 
agcagcatga ccaagatcct ggagcccttc 
caggcccccc tgtacgtggg cagcgacctg 
gagctgcgcg agcacctgct gaagtggggc 
gagcccccct tcctgcccat cgagctgcac 
ctgcccgaga aggacagctg gaccgtgaac 
tgggccagcc agatctaccc cggcatcaag 
gccaaggccc tgaccgacat cgtgcccctg 
aaccgcgaga tcctgcgcga gcccgtgcac 
atcgccgaga tccagaagca gggccacgag 
ttcaagaacc tgaagaccgg caagtacgcc 
aagcagctga ccgaggccgt gcagaagatc 
acccccaagt tccgcctgcc catccagaag 
tggcaggcca cctggatccc cgagtgggag 
tggtaccagc tggagaagga ccccatcgcc 
accaaccgcg aggccaagat cggcaaggcc 
atcgtgaccc tgaccaacac caccaaccag 
ctgcaggaca gcggcagcga ggtgaacatc 
atccaggccc agcccgacaa gagcgacagc 
atcaacaagg agcgcatcta cctgagctgg 
gagcaggtgg acaagctggt gagcaagggc 

<:210> 69 
<211> 1965 
<212> DNA 

<213> Artificial Sec[uence 



gtgagcatca aggtggaggg ccagatcaag 60 
accgtgctgg aggagatcga cctgcccggc 120 
ggcggcttca tcaaggtgcg ccagtacgac 180 
gccatcggca ccgtgctggt. gggccccacc 240 
acccagctgg gctgcaccct gaacttcccc 300 
ctgaagcccg gcatggacgg ccccaaggtg 360 
aaggccctga ccgccatctg cgaggagatg 420 
cccgacaacc cctacaacac ccccgtgttc 480 
cgcaagctgg tggacttccg cgagctgaac 540 
ctgggcatcc cccaccccgc cggcctgaag 600 
ggcgacgcct acttcagcgt gcccctggac 660 
atccccagca tcaacaacga gacccccggc 720 
ggctggaagg gcagccccgc catcttccag 780 
cgcgccaaga accccgacat cgtgatctac 840 
gagatcggcc agcaccgcgc caagatcgag 900 
ttcaccaccc ccgacaagaa gcaccagaag 960 
cccgacaagt ggaccgtgca gcccatcctg 1020 
gacatccaga agctggtggg caagctgaac 1080 
gtgcgccagc tgtgcaagct gctgcgcggc 1140 
accgaggagg ccgagctgga gctggccgag 1200 
ggcgtgtact acgaccccag caaggacctg 1260 
cagtggacct accagatcta ccaggagccc 1320 
aagatgcgca ccacccacac caacgacgtg 1380 
gccatggaga gcatcgtgat ctggggcaag 1440 
gagacctggg agacctggtg gaccgactac 1500 
ttcgtgaaca ccccccccct ggtgaagctg 1560 
ggcgtggaga ccttctacgt ggacggcgcc 1620 
ggctacgtga ccgaccgcgg ccgccagaag 1680 
aagaccgagc tgcaggccat ccagctggcc 1740 
gtgaccgaca gccagtacgc cctgggcatc 1800 
gagatcttca accagatcat cgagcagctg i860 
gtgcccgccc acaagggcat cggcggcaac 1920 
atccgcaagg tgctg 1965 
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<220> 

<223> Description of Artificial Sequence: HIV Type C 

Inactivated Protease Mutated Reverse Transcriptase 
Wild Type 



<400> 69 

cctcaaatca ctctttggca gcgacccctt 
gaggctctct tagccacagg agcagatgat 
aaatggaaac caaaaatgat agggggaatt 
caaatactta tagaaatttg tggaaaaaag 
ccagtcaaca taattggaag aaatctgtta 
attagtccta ttgaaactgt accagtaaaa 
aaacaatggc cattgacaga agaaaaaata 
gagaaggaag gaaaaattac aaaaattggg 
gccataaaaa agaaggacag tactaagtgg 
aaaagaactc aagacttttg ggaagttcaa 
aagaaaaaat cagtgacagt gctagatgtg 
gaaagcttca ggaaatatac tgcattcacc 
attagatatc aatataatgt gctgccacag 
agtagcatga caaaaatctt agagcccttc 
caagccccgt tgtatgtagg atctgactta 
gagttaaggg aacatttatt gaaatgggga 
gaacccccat ttcttcccat cgaactccat 
ctgccagaaa aggatagttg gactgtcaat 
tgggcaagtc agatttaccc agggattaaa 
gccaaagcac taacagacat agtaccacta 
aacagggaaa ttttaagaga accagtacat 
atagctgaaa tacagaaaca ggggcatgaa 
tttaaaaatc tgaaaacagg gaagtatgca 
aaacagttaa cagaggcagt gcaaaaaata 
actcctaaat ttagactacc catccaaaaa 
tggcaagcca cctggatccc tgagtgggag 
tggtaccaac tagaaaaaga tcccatagca 
actaataggg aagctaaaat aggaaaagca 
attgttactc taactaacac aacaaatcag 
ctgcaggatt caggatcaga agtaaacata 
attcaagcac aaccagataa gagtgactca 
ataaacaagg aaagaatcta cctgtcatgg 
gaacaagtag ataaattagt aagtaaggga 

<210> 70 
<211> 1977 
<212> I^A 

<213> Artificial Sequence 



gtctcaataa aagtagaggg ccagataaag 60 
acagtattag aagaaataga tttgccaggg 120 
ggaggtttta tcaaagtaag acagtatgat 180 
gctataggta cagtattagt agggcctaca 240 
actcagcttg gatgcacact aaattttcca 300 
ttaaaaccag gaatggatgg cccaaaggtc 360 
aaagcattaa cagcaatttg tgaggaaatg 420 
cctgataatc catataacac tccagtattt 480 
agaaaattag tagatttcag ggaactcaat 540 
ttaggaatac cacacccagc aggattaaaa 600 
ggggatgcat atttttcagt tcctttagat 660 
atacctagta taaacaatga aacaccaggg 720 
ggatggaaag gatcaccagc aatattccag 780 
agagcaaaaa atccagacat agttatctat 840 
gaaatagggc aacatagagc aaaaatagaa 900 
tttacaacac cagacaagaa acatcaaaaa 960 
cctgacaaat ggacagtaca acctatactg 1020 
gatatacaga agttagtggg aaaattaaac 1080 
gtaaggcaac tctgtaaact cctcaggggg 1140 
actgaagaag cagaattaga attggcagag 1200 
ggagtatatt atgatccatc aaaagacttg 1260 
caatggacat atcaaattta tcaagaacca 1320 
aaaatgagga ctacccacac taatgatgta 1380 
gccatggaaa gcatagtaat atggggaaag 1440 
gaaacatggg agacatggtg gacagactat 1500 
tttgttaata cccctcccct agtaaaatta 1560 
ggagtagaaa ctttctatgt agatggagca 1620 
gggtatgtta ctgacagagg aaggcagaaa 1680 
aagactgagt tacaagcaat tcagctagct 1740 
gtaacagact cacagtatgc attaggaatc 1800 
gagatattta accaaataat agaacagtta i860 
gtaccagcac ataaaggaat tgggggaaat 1920 
attaggaaag tgttg 1965 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
Protease and Reverse Transcriptase Optimized 



<400> 70 

ccccagatca ccctgtggca gcgccccctg 

gaggccctgc tgqacaccqq cgccgacgac 

aagtggaagc ccaagatgat cggcggcatc 

cagatcctga tcgagatctg cggcaagaag 

cccgtgaaca tcatcggccg caacctgctg 

atcagcccca tcgagaccgt gcccgtgaag 



gtgagcatca aggtggaggg ccagatcaag 60 
accgtgctgg aggagatcga cctgcccggc 120 
ggcggcttca tcaaggtgcg ccagtacgac 180 
gccatcggca ccgtgctggt gggccccacc 240 
acccagctgg gctgcaccct gaacttcccc 300 
ctgaagcccg gcatggacgg ccccaaggtg 360 
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aagcagtggc ccctgaccga ggagaagatc aaggccctga ccgccatctg cgaggagatg 420 
gagaaggagg gcaagatcac caagatcggc cccgacaacc cctacaacac ccccgtgttc 480 
gccatcaaga agaaggacag caccaagtgg cgcaagctgg tggacttccg cgagctgaac 540 
aagcgcaccc aggacttctg ggaggtgcag ctgggcatcc cccaccccgc cggcctgaag 600 
aagaagaaga gcgtgaccgt gctggacgtg ggcgacgcct acttcagcgt gcccctggac 660 
gagagcttcc gcaagtacac cgccttcacc atccccagca tcaacaacga gacccccggc 720 
atccgctacc agtacaacgt gctgccccag ggctggaagg gcagccccgc catcttccag 780 
agcagcatga ccaagatcct ggagcccttc cgcgccaaga accccgacat cgtgatctac 840 
cagtacatgg acgacctgta cgtgggcagc gacctggaga tcggccagca ccgcgccaag 900 
atcgaggagc tgcgcgagca cctgctgaag tggggcttca ccacccccga caagaagcac 960 
cagaaggagc cccccttcct gtggatgggc tacgagctgc accccgacaa gtggaccgtg 1020 
cagcccatcc tgctgcccga gaaggacagc tggaccgtga acgacatcca gaagctggtg 1080 
ggcaagctga actgggccag ccagatctac cccggcatca aggtgcgcca gctgtgcaag 1140 
ctgctgcgcg gcgccaaggc cctgaccgac atcgtgcccc tgaccgagga ggccgagctg 1200 
gagctggccg agaaccgcga gatcctgcgc gagcccgtgc acggcgtgta ctacgacccc 1260 
agcaaggacc tgatcgccga gatccagaag cagggccacg agcagtggac ctaccagatc 1320 
taccaggagc ccttcaagaa cctgaagacc ggcaagtacg ccaagatgcg caccacccac 1380 
accaacgacg tgaagcagct gaccgaggcc gtgcagaaga tcgccatgga gagcatcgtg 144 0 
atctggggca agacccccaa gttccgcctg cccatccaga aggagacctg ggagacctgg 1500 
tggaccgact actggcaggc cacctggatc cccgagtggg agttcgtgaa cacccccccc 1560 
ctggtgaagc tgtggtacca gctggagaag gaccccatcg ccggcgtgga gaccttctac 1620 
gtggacggcg ccaccaaccg cgaggccaag atcggcaagg ccggctacgt gaccgaccgc 1680 
ggccgccaga agatcgtgac cctgaccaac accaccaacc agaagaccga gctgcaggcc 1740 
atccagctgg ccctgcagga cagcggcagc gaggtgaaca tcgtgaccga cagccagtac 1800 
gccctgggca tcatccaggc ccagcccgac aagagcgaca gcgagatctt caaccagatc 1860 
atcgagcagc tgatcaacaa ggagcgcatc tacctgagct gggtgcccgc ccacaagggc 1920 
atcggcggca acgagcaggt ggacaagctg gtgagcaagg gcatccgcaa ggtgctg 1977 

<210> 71 
<211> 1977 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Protease and Reverse Transcriptase Wild Type 

<400> 71 

cctcaaatca ctctttggca gcgacccctt gtctcaataa aagtagaggg ccagataaag 60 
gaggctctct tagacacagg agcagatgat acagtattag aagaaataga tttgccaggg 120 
aaatggaaac caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat 180 
caaatactta tagaaatttg tggaaaaaag gctataggta cagtattagt agggcctaca 240 
ccagtcaaca taattggaag aaatctgtta actcagcttg gatgcacact aaattttcca 300 
attagtccta ttgaaactgt accagtaaaa ttaaaaccag gaatggatgg cccaaaggtc 360 
aaacaatggc cattgacaga agaaaaaata aaagcattaa cagcaatttg tgaggaaatg 420 
gagaaggaag gaaaaattac aaaaattggg cctgataatc catataacac tccagtattt 480 
gccataaaaa agaaggacag tactaagtgg agaaaattag tagatttcag ggaactcaat 540 
aaaagaactc aagacttttg ggaagttcaa ttaggaatac cacacccagc aggattaaaa 600 
aagaaaaaat cagtgacagt gctagatgtg ggggatgcat atttttcagt tcctttagat 660 
gaaagcttca ggaaatatac tgcattcacc atacctagta taaacaatga aacaccaggg 720 
attagatatc aatataatgt gctgccacag ggatggaaag gatcaccagc aatattccag 780 
agtagcatga caaaaatctt agagcccttc agagcaaaaa atccagacat agttatctat 840 
caatatatgg atgacttgta tgtaggatct gacttagaaa tagggr.a^»ca tagagcaaaa 900 
atagaagagt taagggaaca tttattgaaa tggggattta caacaccaga caagaaacat 960 
caaaaagaac ccccatttct ttggatgggg tatgaactcc atcctgacaa atggacagta 1020 
caacctatac tgctgccaga aaaggatagt tggactgtca atgatataca gaagttagtg 1080 
ggaaaattaa actgggcaag tcagatttac ccagggatta aagtaaggca actctgtaaa 1140 
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ctcctcaggg gggccaaagc actaacagac atagtaccac taactgaaga agcagaatta 1200 
gaattggcag agaacaggga aattttaaga gaaccagtac atggagtata ttatgatcca 1260 
tcaaaagact tgatagctga aatacagaaa caggggcatg aacaatggac atatcaaatt 1320 
tatcaagaac catttaaaaa tctgaaaaca gggaagtatg caaaaatgag gactacccac 1380 
actaatgatg taaaacagtt aacagaggca gtgcaaaaaa tagccatgga aagcatagta 1440 
atatggggaa agactcctaa atttagacta cccatccaaa aagaaacatg ggagacatgg 1500 
tggacagact attggcaagc cacctggatc cctgagtggg agtttgttaa tacccctccc 1560 
ctagtaaaat tatggtacca actagaaaaa gatcccatag caggagtaga aactttctat 1620 
gtagatggag caactaatag ggaagctaaa ataggaaaag cagggtatgt tactgacaga 1680 
ggaaggcaga aaattgttac tctaactaac acaacaaatc agaagactga gttacaagca 1740 
attcagctag ctctgcagga ttcaggatca gaagtaaaca tagtaacaga ctcacagtat 1800 
gcattaggaa tcattcaagc acaaccagat aagagtgact cagagatatt taaccaaata 1860 
atagaacagt taataaacaa ggaaagaatc tacctgtcat gggtaccagc acataaagga 1920 
attgggggaa atgaacaagt agataaatta gtaagtaagg gaattaggaa agtgttg 1977 

<210> 72 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
RevExonl Optimized 

<400> 72 

atggccggcc gcagcggcga cagcgacgag gccctgctgc aggtggtgaa gatcatcaag 60 
atcctgtacc agagc 75 

<210> 73 
<211> 76 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence; HIV Type C 
RevExonl Wild Type 

<400> 73 

atggcaggaa gaagcggaga cagcgacgaa gcgctcctcc aagtggtgaa gatcatcaaa 60 
atcctctatc aaagca 75 

<210> 74 
<211> 246 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
RevExon2 Optimized 



<400> 74 

ccctacccca agcccgaggg cacccgccag 
gcccgccagc gccagatcca caccatcggc 
agcgccgagc ccgtgcccct gcagctgccc 
gagggcagcg gcaccagcgg cacccagcag 
ccctaa 



gcccgccgca accgccgccg ccgctggcgc 60 
qagcgcatcc tggtggcctg cctgggccgc 120 
cccctggagc gcctgcacat caactgcagc 180 
agccagggca ccaccgaggg cgtgggcgac 240 

246 
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<210> 75 
<211> 248 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
RevExon2 Wild Type 



<400> 75 

acccttaccc caagcccgag gggactcgac 
gagcaagaca gagacagatc catacgattg 
gatctgcgga gcctgtgcct cttcagctac 
gtgagggcag tggaacttct gggacacagc 
atccttaa 

<210> 76 
<211> 1680 
<212> DNA 

<213> Artificial Sequence 



aggctcggag gaatcgaaga agaaggtgga 60 
gtgagcggat tcttgtcgct tgcctgggac 120 
caccgcttga gagacttcat attaattgca 180 
agtctcaggg gactacagag ggggtgggag 240 

248 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
Reverse Transcriptase Optimized 

<:400> 76 

cccatcagcc ccatcgagac cgtgcccgtg aagctgaagc ccggcatgga cggccccaag 60 

gtgaagcagt ggcccctgac cgaggagaag atcaaggccc tgaccgccat ctgcgaggag 120 

atggagaagg agggcaagat caccaagatc ggccccgaca acccctacaa cacccccgtg 180 

ttcgccatca agaagaagga cagcaccaag tggcgcaagc tggtggactt ccgcgagctg 240 

aacaagcgca cccaggactt ctgggaggtg cagctgggca tcccccaccc cgccggcctg 300 

aagaagaaga agagcgtgac cgtgctggac gtgggcgacg cctacttcag cgtgcccctg 360 

gacgagagct tccgcaagta caccgccttc accatcccca gcatcaacaa cgagaccccc 420 

ggcatccgct accagtacaa cgtgctgccc cagggctgga agggcagccc cgccatcttc 480 

cagagcagca tgaccaagat cctggagccc ttccgcgcca agaaccccga catcgtgatc 540 

taccagtaca tggacgacct gtacgtgggc agcgacctgg agatcggcca gcaccgcgcc 600 

aagatcgagg agctgcgcga gcacctgctg aagtggggct tcaccacccc cgacaagaag 660 

caccagaagg agcccccctt cctgtggatg ggctacgagc tgcaccccga caagtggacc 720 

gtgcagccca tcctgctgcc cgagaaggac agctggaccg tgaacgacat ccagaagctg 780 

gtgggcaagc tgaactgggc cagccagatc taccccggca tcaaggtgcg ccagctgtgc 840 

aagctgctgc gcggcgccaa ggccctgacc gacatcgtgc ccctgaccga ggaggccgag 900 

ctggagctgg ccgagaaccg cgagatcctg cgcgagcccg tgcacggcgt gtactacgac 960 

cccagcaagg acctgatcgc cgagatccag aagcagggcc acgagcagtg gacctaccag 1020 

atctaccagg agcccttcaa gaacctgaag accggcaagt acgccaagat gcgcaccacc 1080 

cacaccaacg acgtgaagca gctgaccgag gccgtgcaga agatcgccat ggagagcatc 1140 

gtgatctggg gcaagacccc caagttccgc ctgcccatcc agaaggagac ctgggagacc 1200 

tggtggaccg actactggca ggccacctgg atccccgagt gggagttcgt gaacaccccc 1260 

cccctggtga agctgtggta ccagctggag aaggacccca tcgccggcgt ggagaccttc 1320 

tacgtggacg gcgccaccaa ccgcgaggcc aagatcggca aggccggcta cgtgaccgac 1380 

cgcggccgcc agaagatcgt gaccctgacc aacaccacca accagaagac cgagctgcag 1440 

gccatccagc tggccctgca ggacagcggc agcgaggtga acatcgtgac cgacagccag 1500 

tacgccctgg gcatcatcca ggcccagccc gacaagagcg acagcgagat cttcaaccag 1560 

atcatcgagc agctgatcaa caaggagcgc atctacctga gctqggtgcc cgcccacaag 1620 

ggcatcggcg gcaacgagca ggtggacaag ctggtgagca agggcatccg caaggtgctg 1680 

<210> 77 
<211> 1680 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Reverse Transcriptase wild Type 

<400> 77 

ccaattagtc ctattgaaac tgtaccagta aaattaaaac caggaatgga tggcccaaag 60 
gtcaaacaat ggccattgac agaagaaaaa ataaaagcat taacagcaat ttgtgaggaa 120 
atggagaagg aaggaaaaat tacaaaaatt gggcctgata atccatataa cactccagta 180 
tttgccataa aaaagaagga cagtactaag tggagaaaat tagtagattt cagggaactc 240 
aataaaagaa ctcaagactt ttgggaagtt caattaggaa taccacaccc agcaggatta 300 
aaaaagaaaa aatcagtgac agtgctagat gtgggggatg catatttttc agttccttta 360 
gatgaaagct tcaggaaata tactgcattc accataccta gtataaacaa tgaaacacca 420 
gggattagat atcaatataa tgtgctgcca cagggatgga aaggatcacc agcaatattc 480 
cagagtagca tgacaaaaat cttagagccc ttcagagcaa aaaatccaga catagttatc 540 
tatcaatata tggatgactt gtatgtagga tctgacttag aaatagggca acatagagca 600 
aaaatagaag agttaaggga acatttattg aaatggggat ttacaacacc agacaagaaa 660 
catcaaaaag aacccccatt tctttggatg gggtatgaac tccatcctga caaatggaca 720 
gtacaaccta tactgctgcc agaaaaggat agttggactg tcaatgatat acagaagtta 780 
gtgggaaaat taaactgggc aagtcagatt tacccaggga ttaaagtaag gcaactctgt 840 
aaactcctca ggggggccaa agcactaaca gacatagtac cactaactga agaagcagaa 900 
ttagaattgg cagagaacag ggaaatttta agagaaccag tacatggagt atattatgat 960 
ccatcaaaag acttgatagc tgaaatacag aaacaggggc atgaacaatg gacatatcaa 1020 
atttatcaag aaccatttaa aaatctgaaa acagggaagt atgcaaaaat gaggactacc 1080 
cacactaatg atgtaaaaca gttaacagag gcagtgcaaa aaatagccat ggaaagcata 1140 
gtaatatggg gaaagactcc taaatttaga ctacccatcc aaaaagaaac atgggagaca 1200 
tggtggacag actattggca agccacctgg atccctgagt gggagtttgt taatacccct 1260 
cccctagtaa aattatggta ccaactagaa aaagatccca tagcaggagt agaaactttc 1320 
tatgtagatg gagcaactaa tagggaagct aaaataggaa aagcagggta tgttactgac 1380 
agaggaaggc agaaaattgt tactctaact aacacaacaa atcagaagac tgagttacaa 1440 
gcaattcagc tagctctgca ggattcagga tcagaagtaa acatagtaac agactcacag 1500 
tatgcattag gaatcattca agcacaacca gataagagtg actcagagat atttaaccaa 1560 
ataatagaac agttaataaa caaggaaaga atctacctgt catgggtacc agcacataaa 1620 
ggaattgggg gaaatgaaca agtagataaa ttagtaagta agggaattag gaaagtgttg 1680 

<210> 78 
<211> 1668 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Mutated Reverse Transcriptase Optimized 

<400> 78 

cccatcagcc ccatcgagac cgtgcccgtg aagctgaagc ccggcatgga cggccccaag 60 
gtgaagcagt ggcccctgac cgaggagaag atcaaggccc tgaccgccat ctgcgaggag 120 
atggagaagg agggcaagat caccaagatc ggccccgaca acccctacaa cacccccgtg 180 
ttcgccatca agaagaagga cagcaccaag tggcgcaagc tggtggactt ccgcgagctg 240 
aacaagcgca cccaggactt ctgggaggtg cagctgggca tcccccaccc cgccggcctg 300 
aagaagaaga agagcgtgac cgtgctggac gtgggcgacg cctacttcag cgtgcccctg 360 
gacgagagct tccgcaagta caccgccttc accatcccca gcatcaacaa cgagaccccc 420 
ggcatccgct accagtacaa cgtgctgccc cagggctgga agggcagccc cgccatcttc 480 
cagagcagca tgaccaagat cctggagccc ttccgcgcca agaaccccga catcgtgatc 540 
taccaggccc ccctgtacgt gggcagcgac ctggagatcg gccagcaccg cgccaagatc 600 
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gaggagctgc gcgagcacct gctgaagtgg ggcttcacca cccccgacaa gaagcaccag 660 

aaggagcccc ccttcctgcc catcgagctg caccccgaca agtggaccgt gcagcccatc 720 

ctgctgcccg agaaggacag ctggaccgtg aacgacatcc agaagctggt gggcaagctg 780 

aactgggcca gccagatcta ccccggcatc aaggtgcgcc agctgtgcaa gctgctgcgc 840 

ggcgccaagg ccctgaccga catcgtgccc ctgaccgagg aggccgagct ggagctggcc 900 

gagaaccgcg agatcctgcg cgagcccgtg cacggcgtgt actacgaccc cagcaaggac 960 

ctgatcgccg agatccagaa gcagggccac gagcagtgga cctaccagat ctaccaggag 1020 

cccttcaaga acctgaagac cggcaagtac gccaagaCgc gcaccaccca caccaacgac 1080 

gtgaagcagc tgaccgaggc cgtgcagaag atcgccatgg agagcatcgt gatctggggc 1140 

aagaccccca agttccgcct gcccatccag aaggagacct gggagacctg gtggaccgac 1200 

tactggcagg ccacctggat ccccgagtgg gagttcgtga acaccccccc cctggtgaag 1260 

ctgtggtacc agctggagaa ggaccccatc gccggcgtgg agaccttcta cgtggacggc 1320 

gccaccaacc gcgaggccaa gatcggcaag gccggctacg tgaccgaccg cggccgccag 1380 

aagatcgtga ccctgaccaa caccaccaac cagaagaccg agctgcaggc catccagctg 1440 

gccctgcagg acagcggcag cgaggtgaac atcgtgaccg acagccagta cgccctgggc 1500 

atcatccagg cccagcccga caagagcgac agcgagatct tcaaccagat catcgagcag 1560 

ctgatcaaca aggagcgcat ctacctgagc tgggtgcccg cccacaaggg catcggcggc 1620 

aacgagcagg tggacaagct ggtgagcaag ggcatccgca aggtgctg 1668 

<210> 79 
<211> 1668 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Mutated Reverse Transcriptase Wild Type 

<400> 79 

ccaattagtc ctattgaaac tgtaccagta aaattaaaac caggaatgga tggcccaaag 60 
gtcaaacaat ggccattgac agaagaaaaa ataaaagcat taacagcaat ttgtgaggaa 120 
atggagaagg aaggaaaaat tacaaaaatt gggcctgata atccatataa cactccagta 180 
tttgccataa aaaagaagga cagtactaag tggagaaaat tagtagattt cagggaactc 240 
aataaaagaa ctcaagactt ttgggaagtt caattaggaa taccacaccc agcaggatta 300 
aaaaagaaaa aatcagtgac agtgctagat gtgggggatg catatttttc agttccttta 360 
gatgaaagct tcaggaaata tactgcattc accataccta gtataaacaa tgaaacacca 420 
gggattagat atcaatataa tgtgctgcca cagggatgga aaggatcacc agcaatattc 480 
cagagtagca tgacaaaaat cttagagccc ttcagagcaa aaaatccaga catagttatc 540 
tatcaagccc cgttgtatgt aggatctgac ttagaaatag ggcaacatag agcaaaaata 600 
gaagagttaa gggaacattt attgaaatgg ggatttacaa caccagacaa gaaacatcaa 660 
aaagaacccc catttcttcc catcgaactc catcctgaca aatggacagt acaacctata 720 
ctgctgccag aaaaggatag ttggactgtc aatgatatac agaagttagt gggaaaatta 780 
aactgggcaa gtcagattta cccagggatt aaagtaaggc aactctgtaa actcctcagg 840 
ggggccaaag cactaacaga catagtacca ctaactgaag aagcagaatt agaattggca 900 
gagaacaggg aaattttaag agaaccagta catggagtat attatgatcc atcaaaagac 960 
ttgatagctg aaatacagaa acaggggcat gaacaatgga catatcaaat ttatcaagaa 1020 
ccatttaaaa atctgaaaac agggaagtat gcaaaaatga ggactaccca cactaatgat 1080 
gtaaaacagt taacagaggc agtgcaaaaa atagccatgg aaagcatagt aatatgggga 1140 
aagactccta aatttagact acccatccaa aaagaaacat gggagacatg gtggacagac 1200 
tattggcaag ccacctggat ccctgagtgg gagtttgtta atacccctcc cctagtaaaa 1260 
ttatggtacc aactagaaaa agatcccata gcaggagtag aaactttcta tgtagatgga 1320 
gcaactaata gggaagctaa aataggaaaa gcagggtatg ttactgacag aggaaggcag 1380 
aaaattgtta ctctaactaa cacaacaaat cagaagactg agttacaagc aattcagcta 1440 
gctctgcagg attcaggatc agaagtaaac atagtaacag actcacagta tgcattagga 1500 
atcattcaag cacaaccaga taagagtgac tcagagatat ttaaccaaat aatagaacag 1560 
ttaataaaca aggaaagaat ctacctgtca tgggtaccag cacataaagg aattggggga 1620 
aatgaacaag tagataaatt agtaagtaag ggaattagga aagtgttg 1668 
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<210> 80 
<211> 216 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
TatC22Exonl Optimized 



<400> 80 

atggagcccg tggaccccaa gctgaagccc 

gccggcaaca actgcttctg caagcactgc 

aagggcctgg gcatcagcta cggccgcaag 

agcggcgagg accaccagaa ccccctgagc 

<210> 81 
<211> 216 
<212> DNA 

<213> Artificial Sequence 



tggaaccacc ccggcagcca gcccaagacc 60 
agctaccact gcctggtgtg cttccagacc 120 
aagcgccgcc agcgccgcag cgcccccccc 180 
aagcag 216 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
TatExonl Optiraized 



<400> 81 

atggagcccg tggaccccaa gctgaagccc 

gcctgcaaca actgcttctg caagcactgc 

aagggcctgg gcatcagcta cggccgcaag 

agcggcgagg accaccagaa ccccctgagc 



tggaaccacc ccggcagcca gcccaagacc 60 
agctaccact gcctggtgtg cttccagacc 120 
aagcgccgcc agcgccgcag cgcccccccc 180 
aagcag 216 



<210> 82 
<211> 216 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
TatExonl Wild Type 



<400> 82 

atggagccag tagatcctaa actaaagccc 

gcttgtaata attgcttttg caaacactgt 

aaaggtttag gcatttccta tggcaggaag 

agtggtgaag atcatcaaaa tcctctatca 

<210> 83 
<211> 93 
<212> DNA 

<213> Artificial Sequence 



tggaaccatc caggaagcca acctaaaaca 60 
agctatcatt gtctagtttg ctttcagaca 120 
aagcggagac agcgacgaag cgctcctcca 180 
aagcag 216 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
TatExon2 Optimized 

<400>. 83 

cccctgcccc aggcccgcgg cgacagcacc ggcagcgagg agagcaagaa gaaggtggag 60 
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agcaagaccg agaccgaccc ctacgactgg tga 93 
<210> 84 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
TatExon2 Wild Type . 

<460> 84 

cccttacccc aagcccgagg ggactcgaca ggctcggagg aatcgaagaa gaaggtggag 60 
agcaagacag agacagatcc atacgattgg tga 93 

<210> 85 
<211> 579 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
Vif Optimized 



<400> 65 

atggagaacc gctggcaggt gctgatcgtg tggcaggtgg accgcatgaa gatccgcgcc 60 
tggaacagcc tggtgaagca ccacatgtac atcagccgcc gcgccagcgg ctgggtgtac 120 
cgccaccact tcgagagccg ccaccccaag gtgagcagcg aggtgcacat ccccctgggc 180 
gacgcccgcc tggtgatcaa gacctactgg ggcctgcaga ccggcgagcg cgactggcac 240 
ctgggccacg gcgtgagcat cgagtggcgc ctgcgcgagt acagcaccca ggtggacccc 300 
gacctggccg accagctgat ccacatgcac tacttcgact gcttcaccga gagcgccatc 360 
cgccaggcca tcctgggcca catcgtgttc ccccgctgcg actaccaggc cggccacaag 420 
aaggtgggca gcctgcagta cctggccctg accgccctga tcaagcccaa gaagcgcaag 480 
ccccccctgc ccagcgtgcg caagctggtg gaggaccgct ggaacgaccc ccagaagacc 540 
cgcggccgcc gcggcaacca caccatgaac ggccactag 579 

<210> 86 
<211> 579 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: . HIV Type C 
Vif Wild Type 



<400> 86 

atggaaaaca gatggcaggt gctgattgtg 
tggaatagtt tagtaaagca ccatatgtat 
agacatcatt ttgaaagcag acatccaaaa 
gatgctagat tagtaataaa aacatattgg 
ttgggtcatg gagtctccat agaatggaga 
gacctggcag accagctaat tcacatgcat 
agacaagcca tattaggaca catagttttt 
aaggtaggat ctctgcaata cttggcactg 
ccacctctgc ctagtgttag aaaattagta 
aggggccgca gagggaacca tacaatgaat 

<210> 87 



tggcaggtgg acaggatgaa gattagagca 60 
atatcaagga gagctagtgg atgggtctac 120 
gtaagttcag aagtacatat cccattaggg 180 
ggtttgcaga caggagaaag agattggcat 240 
ctgagagaat acagcacaca agtagaccct 300 
tattttgatt gttttacaga atctgccata 360 
cctaggtgtg actatcaagc aggacataag 420 
acagcattga taaaaccaaa aaagagaaag 4 BO 
gaggatagat ggaacgaccc ccagaagacc 540 
ggacactag 579 
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<21X> 288 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Vpr Optimized 

<400> 87 

atggagcgcc cccccgagga ccagggcccc cagcgcgagc cctacaacga gtggaccctg 60 
gagatcctgg aggagctgaa gcaggaggcc gtgcgccact tcccccgccc ctggctgcac 120 
agcctgggcc agtacatcta cgagacctac ggcgacacct ggaccggcgt ggaggccatc 180 
atccgcgtgc tgcagcagct gctgttcatc cacttccgca tcggctgcca gcacagccgc 240 
atcggcatcc tgcgccagcg ccgcgcccgc aacggcgcca gccgcagc 288 

<210> 88 
<211> 288 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Vpr Wild Type 

<400> 88 

atggaacgac ccccagaaga ccaggggccg cagagggaac catacaatga atggacacta 60 
gagattctag aagaactcaa gcaggaagct gtcagacact ttcctagacc atggctccat 120 
agcttaggac aatatatcta tgaaacctat ggggatactt ggacgggagt tgaagctata 180 
ataagagtac tgcaacaact actgttcatt catttcagaa ttggatgcca acatagcaga 240 
ataggcatct tgcgacagag aagagcaaga aatggagcca gtagatcc 288 

<210> 89 
<211> 267 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Vpu Optimized 

<400> 89 

atggtgagcc tgagcctgtt caagggcgtg gactaccgcc tgggcgtggg cgccctgatc 60 

gtggccctga tcatcgccat catcgtgtgg accatcgcct acatcgagta ccgcaagctg 120 

gtgcgccaga agaagatcga ctggctgatc aagcgcatcc gcgagcgcgc cgaggacagc 180 

ggcaacgaga gcgacggcga caccgaggag ctgagcacca tggtggacat gggccacctg 240 
cgcctgctgg acgccaacga cctgtaa 267 

<210> 90 
<211> 267 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Vpu Wild Type 

<400> 90 
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atggtaagtt taagtttatt taaaggagta gattatagat taggagtagg agcattgata 60 
gtagcactaa tcatagcaat aatagtgtgg accatagcat atatagaata taggaaattg 120 
gtaagacaaa agaaaataga ctggttaatc aaaagaatta gggaaagagc agaagacagt 180 
ggcaatgaga gtgatgggga cacagaagaa ttgtcaacaa tggtggatat ggggcatctt 240 
aggcttctgg atgctaatga tttgtaa 267 

<210> 91 
<211> 321 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
RevBxon 1 and 2 Optimized 



<400> 91 

atggccggcc gcagcggcga cagcgacgag 
atcctgtacc agagccccta ccccaagccc 
cgccgccgct ggcgcgcccg ccagcgccag 
gcctgcctgg gccgcagcgc cgagcccgtg 
cacatcaact gcagcgaggg cagcggcacc 
gagggcgtgg gcgaccccta a 



gccctgctgc aggtggtgaa gatcatcaag 60 
gagggcaccc gccaggcccg ccgcaaccgc 120 
atccacacca tcggcgagcg catcctggtg 180 
cccctgcagc tgccccccct ggagcgcctg 240 
agcggcaccc agcagagcca gggcaccacc 300 

321 



<210> 92 
<211> 324 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: HIV Type C 
RevExon 1 and 2 Wild Type 

<400> 92 

atggcaggaa gaagcggaga cagcgacgaa gcgctcctcc aagtggtgaa gatcatcaaa 60 
atcctctatc aaagcaaccc ttaccccaag cccgagggga ctcgacaggc tcggaggaat 120 
cgaagaagaa ggtggagagc aagacagaga cagatccata cgattggtga gcggattctt 180 
gtcgcttgcc tgggacgatc tgcggagcct gtgcctcttc agctaccacc gcttgagaga 240 
cttcatatta attgcagtga gggcagtgga acttctggga cacagcagtc tcaggggact 300 
acagaggggg tgggagatcc ttaa 324 

<210> 93 
<211> 309 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
TatC22 Exon 1 and 2 Optimized 

<400> 93 

atggagcccg tggaccccaa gctgaagccc tggaaccacc ccggcagcca gcccaagacc 60 
gccggcaaca actgcttctg caagcactgc agctaccact gcctggtgtg cttccagacc 120 
aagggcctgg gcatcagcta cggccgcaag aagcgccgcc agcgccgcaq cgcccccccc 180 
agcggcgagg accaccagaa ccccctgagc aagcagcccc tgccccaggc ccgcggcgac 24 0 
agcaccggca gcgaggagag caagaagaag gtggagagca agaccgagac cgacccctac 300 
gactggtga 309 
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<210> 94 
<211> 309 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Tat Exon X cuad 2 Optimized 

<400> 94 

atggagcccg tggaccccaa gctgaagccc tggaaccacc ccggcagcca 
gcctgcaaca actgcttctg caagcactgc agctaccact gcctggtgtg 
aagggcctgg gcatcagcta cggccgcaag aagcgccgcc agcgccgcag 
agcggcgagg accaccagaa ccccctgagc aagcagcccc tgccccaggc 
agcaccggca gcgaggagag caagaagaag gtggagagca agaccgagac 
gactggtga 

<210> 95 
<211> 309 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 
Tat Exon 1 and 2 Wild Type 

<400> 95 

atggagccag tagatcctaa actaaagccc tggaaccatc caggaagcca 
gcttgtaata attgcttttg caaacactgt agctatcatt gtctagtttg 
aaaggtttag gcatttccta tggcaggaag aagcggagac agcgacgaag 
agtggtgaag atcatcaaaa tcctctatca aagcagccct taccccaagc 
tcgacaggct cggaggaatc gaagaagaag gtggagagca agacagagac 
gattggtga 

<210> 96 
<211> 624 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV Type C 

NefD125g Optimized Myristalization Modification 

<400> 96 

atggccggca agtggagcaa gcgcagcatc gtgggctggc ccgccgtgcg cgagcgcatg 60 
cgccgcaccg agcccgccgc cgagggcgtg ggcgccgcca gccaggacct ggaccgccac 120 
ggcgccctga ccagcagcaa cacccccgcc accaacgagg cctgcgcctg gctgcaggcc 180 
caggaggagg acggcgacgt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 
acctacaaga gcgccgtgga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 
ctgatctaca gccgcaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 
ttcttccccg gctggcagaa ctacaccagc ggccccggcg tgcgcttccc cctgaccttc 420 
ggctggtgct tcaagctggt gcccgtggac ccccgcgagg tgaaggaggc caacgagggc 480 
gaggacaact gcctgctgca ccccatgagc cagcacggcg ccgaggacga ggaccgcgag 540 
gtgctgaagt ggaagttcga cagcctgctg gcccaccgcc acatggcccg cgagctgcac 600 
cccgagtact acaaggactg ctga 624 

<210> 97 



gcccaagacc 60 
cttccagacc 120 
cgcccccccc 180 
ccgcggcgac 240 
cgacccctac 300 
309 



acctaaaaca 60 
ctttcagaca 120 
cgctcctcca 180 
ccgaggggac 240 
agatccatac 300 
309 
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<211> 2565 
<212> DNA 

<213> Artificial Sequence 
c220> 

<223> Description of Artificial Sequence: Envgpl60_TV2_C_2Aopt 
<400> 97 

atgcgcgccc gcggcatcct gaagaactac cgccactggt ggatctgggg catcctgggc 60 
ttctggatgc tgatgatgtg caacgtgaag ggcctgtggg tgaccgtgta ctacggcgtg 120 
cccgtgggcc gcgaggccaa gaccaccctg ttctgcgcca gcgacgccaa ggcctacgag 180 
aaggaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 240 
gaggtgatcc tgggcaacgt gaccgagaac ttcaacatgt ggaagaacga catggtggac 300 
cagatgcagg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 360 
acccccctgt gcgtgaccct gaactgcacc aacgccaccg tgaactacaa caacaccagc 420 
aaggacatga agaactgcag cttctacgtg accaccgagc tgcgcgacaa gaagaagaag 480 
gagaacgccc tgttctaccg cctggacatc gtgcccctga acaaccgcaa gaacggcaac 540 
atcaacaact accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag 600 
gtgagcttcg accccatccc catccactac tgcgcccccg ccggctacgc ccccctgaag 660 
tgcaacaaca agaagttcaa cggcatcggc ccctgcgaca acgtgagcac cgtgcagtgc 720 
acccacggca tcaagcccgt ggtgagcacc cagctgctgc • tgaacggcag cctggccgag 780 
gaggagatca tcatccgcag cgagaacctg accaacaacg tgaagaccat catcgtgcac 840 
ctgaacgaga gcatcgagat caagtgcacc cgccccggca acaacacccg caagagcgtg 900 
cgcatcggcc ccggccaggc cttctacgcc accggcgaca tcatcggcga catccgccag 960 
gcccactgca acatcagcaa gaacgagtgg aacaccaccc tgcagcgcgt gagccagaag 1020 
ctgcaggagc tgttccccaa cagcaccggc atcaagttcg ccccccacag cggcggcgac 1080 
ctggagatca ccacccacag cttcaactgc ggcggcgagt tcttctactg caacaccacc 1140 
gacctgttca acagcaccta cagcaacggc acctgcacca acggcacctg catgagcaac 1200 
aacaccgagc gcatcaccct gcagtgccgc atcaagcaga tcatcaacat gtggcaggag 1260 
gtgggccgcg ccatgtacgc cccccccatc gccggcaaca tcacctgccg cagcaacatc 1320 
accggcctgc tgctgacccg cgacggcggc gacaacaaca ccgagaccga gaccttccgc 1380 
cccggcggcg gcgacatgcg cgacaactgg cgcagcgagc tgtacaagta caaggtggtg 1440 
gagatcaagc ccctgggcgt ggcccccacc gccgccaagc gccgcgtggt ggagcgcgag 1500 
aagcgcgccg tgggcatcgg cgccgtgttc ctgggcttcc tgggcgccgc cggcagcacc 1560 
atgggcgccg ccagcatcac cctgaccgtg caggcccgcc agctgctgag cggcatcgtg 1620 
cagcagcaga gcaacctgct gcgcgccatc gaggcccagc agcacatgct gcagctgacc 1680 
gtgtggggca tcaagcagct gcaggcccgc gtgctggcca tcgagcgcta cctgcaggac 1740 
cagcagctgc tgggcctgtg gggctgcagc ggcaagctga tctgcaccac caacgtgctg 1800 
tggaacagca gctggagcaa caagacccag agcgacatct gggacaacat gacctggatg 1860 
cagtgggacc gcgagatcag caactacacc aacaccatct accgcctgct ggaggacagc 1920 
cagagccagc aggagcgcaa cgagaaggac ctgctggccc tggaccgctg gaacaacctg 1980 
tggaactggt tcagcatcac caactggctg tggtacatca agatcttcat catgatcgtg 2040 
ggcggcctga tcggcctgcg catcatcttc gccgtgctga gcctggtgaa ccgcgtgcgc 2100 
cagggctaca gccccctgag cctgcagacc ctgatcccca acccccgcgg ccccgaccgc 2160 
ctgggcggca tcgaggagga gggcggcgag caggacagca gccgcagcat ccgcctggtg 2220 
agcggcttcc tgaccctggc ctgggacgac ctgcgcagcc tgtgcctgtt ctgctaccac 2280 
cgcctgcgcg acttcatcct gatcgtggtg cgcgccgtgg agctgctggg ccacagcagc 2340 
ctgcgcggcc tgcagcgcgg ctggggcacc ctgaagtacc tgggcagcct ggtgcagtac 2400 
tggggcctgg agctgaagaa gagcgccatc aacctgctgg acaccatcgc catcgccgtg 2460 
gccgagggca ccgaccgcat cctggagttc atccagaacc tgtgccgcgg catccgcaac 2520 
gtgccccgcc gcatccgcca gggcttcgag gccgccctgc agtaa 2565 

<210> 98 
<211> 2565 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Envgpl60_TV2_C_2Awt 
<400> 98 

atgagagcga gggggatact gaagaattat cgacactggt ggatatgggg catcttaggc 60 
ttttggatgc taatgatgtg taatgtgaag ggcttgtggg tcacagtcta ctacggggta 120 
cctgtgggga gagaagcaaa aactactcta ttttgtgcat cagatgctaa agcatatgag 180 
aaagaagtgc ataatgtctg ggctacacat gcctgtgtac ccacagaccc caacccacaa 240 
gaagtgattt tgggcaatgt aacagaaaat tttaacatgt ggaaaaatga catggtggat 300 
cagatgcagg aagatataat cagtttatgg gatcaaagcc ttaagccatg tgtaaaattg 360 
accccactct gtgtcacttt aaactgtaca aatgcaactg ttaactacaa taatacctct 420 
aaagacatga aaaattgctc tttctatgta accacagaat taagagataa gaaaaagaaa 480 
gaaaatgcac ttttttatag acttgatata gtaccactta ataataggaa gaatgggaat 540 
attaacaact atagattaat aaattgtaat acctcagcca taacacaagc ctgtccaaaa 600 
gtctcgtttg acccaattcc tatacattat tgtgctccag ctggttatgc gcctctaaaa 660 
tgtaataata agaaattcaa tggaatagga ccatgcgata atgtcagcac agtacaatgt 720 
acacatggaa ttaagccagt ggtatcaact caattactgt taaatggtag cctagcagaa 780 
gaagagataa taattagatc tgaaaatctg acaaacaatg tcaaaacaat aatagtacat 840 
cttaatgaat ctatagagat taaatgtaca agacctggca ataatacaag aaagagtgtg 900 
agaataggac caggacaagc attctatgca acaggagaca taataggaga tataagacaa 960 
gcacattgta acattagtaa aaatgaatgg aatacaactt tacaaagggt aagtcaaaaa 1020 
ttacaagaac tcttccctaa tagtacaggg ataaaatttg caccacactc aggaggggac 1080 
ctagaaatta ctacacatag ctttaattgt ggaggagaat ttttctattg caatacaaca 1140 
gacctgttta atagtacata cagtaatggt acatgcacta atggtacatg catgtctaat 1200 
aatacagagc gcatcacact ccaatgcaga ataaaacaaa ttataaacat gtggcaggag 1260 
gtaggacgag caatgtatgc ccctcccatt gcaggaaaca taacatgtag atcaaatatt 1320 
acaggactac tattaacacg tgatggagga gataataata ctgaaacaga gacattcaga 1380 
cctggaggag gagacatgag ggacaattgg agaagtgaat tatataaata caaggtggta 1440 
gaaattaaac cattaggagt agcacccact gctgcaaaaa ggagagtggt ggagagagaa 1500 
aaaagagcag taggaatagg agctgtgttc cttgggttct tgggagcagc aggaagcact 1560 
atgggcgcag catcaataac gctgacggta caggccagac aattattgtc tggtatagtg 1620 
caacagcaaa gtaatttgct gagggctata gaggcgcaac agcatatgtt gcaactcacg 1680 
gtctggggca ttaagcagct ccaggcaaga gtcctggcta tagagagata cctacaggat 1740 
caacagctcc taggactgtg gggctgctct ggaaaactca tctgcaccac taatgtgctt 1800 
tggaactcta gttggagtaa taaaactcaa agtgatattt gggataacat gacctggatg 1660 
cagtgggata gggaaattag taattacaca aacacaatat acaggttgct tgaagactcg 1920 
caaagccagc aggaaagaaa tgaaaaagat ttactagcat tggacaggtg gaacaatctg 1980 
tggaattggt ttagcataac aaattggctg tggtatataa aaatattcat aatgatagta 2040 
ggaggcttga taggtttaag aataattttt gctgtgctct ctctagtaaa tagagttagg 2100 
cagggatact cacccttgtc attgcagacc cttatcccaa acccgagggg acccgacagg 2160 
ctcggaggaa tcgaagaaga aggtggagag caagacagca gcagatccat tcgattagtg 2220 
agcggattct tgacacttgc ctgggacgac ctacgaagcc tgtgcctctt ctgctaccac 2280 
cgattgagag acttcatatt aattgtagtg agagcagtgg aacttctggg acacagtagt 2340 
ctcaggggac tgcagagggg gtggggaacc cttaagtatt tggggagtct tgtgcaatat 2400 
tggggtctag agttaaaaaa gagtgctatt: aatctgcttg atactatagc aatagcagta 2460 
gctgaaggaa cagataggat tctagaattc atacaaaacc tttgtagagg tatccgcaac 2520 
gtacctagaa gaataagaca gggcttcgaa gcagctttgc aataa 2565 

<210> 99 
<211> 1491 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gag_TV2_C_ZAopt 
<400> 99 
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atgggcgccc gcgccagcat cctgcgcggc ggcaagctgg acaagtggga gaagatccgc 60 
ctgcgccccg gcggccgcaa gcactacatg ctgaagcacc tggtgtgggc cagccgcgag 120 
ctggagcgct tcgccgtgaa ccccggcctg ctggagacca gcgacggctg ccgccagatc 180 
atcaagcagc tgcagcccgc cctgcagacc ggcaccgagg agatccgcag cctgttcaac 240 
accgtggcca ccctgtactg cgtgcacaag ggcatcgacg tgcgcgacac caaggaggcc 300 
ctggacaaga tcgaggagga gcagaacaag tgccagcaga agacccagca ggccgaggcc 360 
gccgacaaga aggtgagcca gaactacccc atcgtgcaga acctgcaggg ccagatggtg 420 
caccaggcca tcagcccccg caccctgaac gcctgggtga aggtgatcga ggagaaggcc 4 80 
ttcagccccg aggtgatccc catgttcacc gccctgagcg agggcgccac cccccaggac 540 
ctgaacacca tgctgaacac cgtgggcggc caccaggccg ccatgcagat gctgaaggac 600 
accatcaacg aggaggccgc cgagtgggac cgcctgcacc ccgtgcacgc cggccccgtg 660 
gcccccggcc agatgcgcga gccccgcggc agcgacatcg ccggcaccac cagcaccctg 720 
caggagcaga tcgcctggat gaccagcaac ccccccatcc ccgtgggcga catctacaag 780 
cgctggatca tcctgggcct gaacaagatc gtgcgcatgt acagccccgt gagcatcctg 840 
gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 900 
ctgcgcgccg agcagagcac ccaggaggtg aagaactgga tgaccgacac cctgctggtg 960 
cagaacgcca accccgactg caagaccatc ctgcgcgccc tgggccccgg cgccagcctg 1020 
gaggagatga tgaccgcctg ccagggcgtg ggcggcccca gccacaaggc ccgcgtgctg 1080 
gccgaggcca tgagccaggc caacaacacc agcgtgatga tccagaagag caacttcaag 1140 
ggcccccgcc gcgccgtgaa gtgcttcaac tgcggccgcg agggccacat cgcccgcaac 1200 
tgccgcgccc cccgcaagcg cggctgctgg aagtgcggca aggagggcca ccagatgaag 1260 
gactgcaccg agcgccaggc caacttcctg ggcaagatct ggcccagcca caagggccgc 1320 
cccggcaact tcctgcagag ccgccccgag cccaccgccc cccccctgga gcccaccgcc 1380 
ccccccgccg agagcttcaa gttcaaggag acccccaagc aggagcccaa ggaccgcgag 1440 
cccctgacca gcctgaagag cctgttcggc agcgaccccc tgagccagta a 1491 

<210> 100 
<211> 1491 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gag_TV2_C_2Awt 
<400> 100 

atgggtgcga gagcgtcaat attaagaggg ggaaaattag acaaatggga aaaaattagg 60 
ttacggccag gggggagaaa acactatatg ctaaaacacc tagtatgggc aagcagagag 120 
ctggaaagat ttgcagttaa ccctggcctt ttagagacat cagacggatg tagacaaata 180 
ataaaacagc tacaaccagc tcttcagaca ggaacagagg aaattagatc attatttaac 240 
acagtagcaa ctctctattg tgtacataaa gggatagatg tacgagacac caaggaagcc 300 
ttagacaaga tagaggagga acaaaacaaa tgtcagcaaa aaacacagca ggcggaagcg 360 
gctgacaaaa aggtcagtca aaattatcct atagtgcaga acctccaagg gcaaatggta 420 
caccaggcca tatcacctag aaccttgaat gcatgggtaa aagtaataga ggagaaggct 480 
tttagcccag aggtaatacc catgtttaca gcattatcag aaggagccac cccacaagat 540 
ttaaacacca tgttaaatac agtgggggga catcaagcag ccatgcaaat gttaaaagat 600 
accatcaatg aggaggctgc agaatgggat aggttacatc cagtacatgc agggcctgtt 660 
gcaccaggcc agatgagaga accaagggga agtgacatag caggaactac tagtaccctt 720 
caagaacaaa tagcatggat gacaagtaac ccacctatcc cagtagggga catctataaa 780 
aggtggataa ttctggggtt aaataaaata gtaagaatgt acagccctgt cagcatttta 840 
gacataaaac aaggaccaaa ggaacccttt agagactatg tagaccggtt cttcaaaact 900 
ttaagagctg aacaatctac acaagaggta aaaaattgga tgacagacac cttgttagtc 960 
caaaatgcga acccagattg taagaccatt ttaagagcat taggaccagg ggcttcatta 1020 
gaagaaatga tgacagcatg tcagggagtg ggaggaccta gccacaaagc aagaqttttg 1080 
gctgaggcaa tgagccaagc aaacaataca agtgtaatga tacagaaaag caattttaaa 1140 
ggccctagaa gagctgttaa atgtttcaac tgtggcaggg aagggcacat agccaggaat 1200 
tgcagggccc ctaggaaaag gggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa 1260 
gactgtactg agaggcaggc taatttttta gggaaaattt ggccttccca caaggggagg 1320 
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ccagggaatt tccttcagag cagaccagag ccaacagccc caccactaga accaacagcc 1380 
ccaccagcag agagcttcaa gttcaaggag actccgaagc aggagccgaa agacagggaa 1440 
cctttaactt ccctcaaatc actctttggc agcgacccct tgtctcaata a 1491 

<210> 101 
<211> 624 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nef_TV2_C_ZAopt 
<400> 101 

atgggcggca agtggagcaa gagcagcatc atcggctggc ccgaggtgcg cgagcgcatc 60 
cgccgcaccc gcagcgccgc cgagggcgtg ggcagcgcca gccaggacct ggagaagcac 120 
ggcgccctga ccaccagcaa caccgcccac aacaacgccg cctgcgcctg gctggaggcc 180 
caggaggagg agggcgaggt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 
acctacaagg ccgccatcga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 
ctgatctaca gcaagaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 
ttcttccccg actggcagaa ctacaccccc ggccccggcg tgcgcttccc cctgaccttc 420 
ggctggtact tcaagctgga gcccgtggac ccccgcgagg tggaggaggc caacgagggc 480 
gagaacaact gcctgctgca ccccatgagc cagcacggca tggaggacga ggaccgcgag 54 0 
gtgctgcgct ggaagttcga cagcaccctg gcccgccgcc acatggcccg cgagctgcac 600 
cccgagtact acaaggactg ctga 624 

<210> 102 
<211> 624 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence: Nef_TV2_C_ZA_wt 



<400> 102 

atggggggca agtggtcaaa aagcagtata 
.agacgaacta ggtcagcagc agagggagta 
ggggcactta caaccagcaa cacagcccac 
caagaggagg aaggagaagt aggctttcca 
acttataaag cagcaataga tctcagcttc 
ttaatttact ccaagaaaag gcaagagatc 
ttcttccctg attggcaaaa ctacacaccg 
ggatggtact tcaagctaga gccagtcgat 
gaaaacaact gtttactaca ccctatgagc 
gtattaagat ggaagtttga cagtacgcta 
ccggagtatt acaaagactg ctga 



attggatggc ctgaagtaag agaaagaatc 60 
ggatcagcgt ctcaagactt agagaaacat 120 
aacaatgctg cttgcgcctg gctggaagcg 180 
gtcagacctc aggtaccttt aagaccaatg 240 
tttttaaaag aaaagggggg actggaaggg 300 
cttgatttgt gggtttataa cacacaaggc 360 
ggaccagggg tcagatttcc actgaccttt 420 
ccaagggaag tagaagaggc caatgaagga 480 
cagcatggaa tggaggatga agacagagaa 540 
gcacgcagac acatggcccg cgagctacat 600 

624 



<210> 103 
<211> 3009 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Pol_TV2_C_ZAopt 
<400> 103 

ttcttccgcg agaacctggc cttcccccag ggcgaggccc gcgagttccc cagcgagcag 60 
acccgcgcca acagccccac cacccgcacc aacagcccca ccagccgcga gctgcaggtg 120 
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cagggcgaca gcgaggccgg cgccgagcgc cagggcacct tcaacttccc ccagatcacc 180 
ctgtggcagc gccccctggt gagcatcaag gtggccggcc agaccaagga ggccctgctg 240 
gacaccggcg ccgacgacac cgtgctggag gagatcaacc tgcccggcaa gtggaagccc 300 
aagatgatcg gcggcatcgg cggcttcatc aaggtgcgcc agtacgacca gatcctgatc 360 
gagatctgcg gcaagcgcgc catcggcacc gtgctggtgg gccccacccc cgcgaacatc 420 
atcggccgca acctgctgac ccagctgggc tgcaccctga acttccccat cagccccatc 480 
gagaccgtgc ccgtgaagct gaagcccggc atggacggcc ccaaggtgaa gcagtggccc 540 
ctgaccgagg agaagatcaa ggccctgacc gagatctgcg aggagatgga gaaggagggc 600 
aagatcacca agatcggccc cgagaacccc tacaacaccc ccgtgttcgc catcaagaag 660 
aaggacagca ccaagtggcg caagctggtg gacttccgcg agctgaacaa gcgcacccag 720 
gacttctggg aggtgcagct gggcatcccc caccccgccg gcctgaagaa gaagaagagc 780 
gtgaccgtgc tggacgtggg cgacgcctac ttcagcgtgc ccctggacga gagcttccgc 840 
aagtacaccg ccttcaccat ccccagcatc aacaacgaga cccccggcat ccgctaccag 900 
tacaacgtgc tgccccaggg ctggaagggc agccccgcca tcttccagag cagcatgacc 960 
cgcatcctgg agcccttccg cacccagaac cccgaggtgg tgatctacca gtacatggac 1020 
gacctgtacg tgggcagcga cctggagatc ggccagcacc gcgccaagat cgaggagctg 1080 
cgcggccacc tgctgaagtg gggcttcacc acccccgaca agaagcacca gaaggagccc 1140 
cccttcctgt ggatgggcta cgagctgcac cccgacaagt ggaccgtgca gcccatccag 1200 
ctgcccgaga aggagagctg gaccgtgaac gacatccaga agctggtggg caagctgaac 1260 
tgggccagcc agatctaccc cggcatcaag gtgcgccagc tgtgcaagct gctgcgcggc 1320 
gccaaggccc tgaccgacat cgtgcccctg accgaggagg ccgagctgga gctggccgag 1380 
aaccgcgaga tcctgaagga gcccgtgcac ggcgtgtact acgaccccag caaggacctg 1440 
atcgccgaga tccagaagca gggcaacgac cagtggacct accagatcta ccaggagccc 1500 
ttcaagaacc tgcgcaccgg caagtacgcc aagatgcgca ccgcccacac caacgacgtg 1560 
aagcagctgg ccgaggccgt gcagaagatc acccaggaga gcatcgtgat ctggggcaag 1620 
acccccaagt tccgcctgcc catccccaag gagacctggg agacctggtg gagcgactac 1680 
tggcaggcca cctggatccc cgagtgggag ttcgtgaaca ccccccccct ggtgaagctg 1740 
tggtaccagc tggagaagga gcccatcgtg ggcgccgaga ccttctacgt ggacggcgcc 1800 
gccaaccgcg agaccaagat cggcaaggcc ggctacgtga ccgacaaggg ccgccagaag 1860 
gtggtgagct tcaccgagac caccaaccag aagaccgagc tgcaggccat ccagctggcc 1920 
ctgcaggaca gcggccccga ggtgaacatc gtgaccgaca gccagtacgc cctgggcatc 1980 
atccaggccc agcccgacaa gagcgagagc gagctggtga gccagatcat cgagcagctg 2040 
atcaagaagg agaaggtgta cctgagctgg gtgcccgccc acaagggcat cggcggcaac 2100 
gagcaggtgg acaagctggt gagcagcggc atccgcaagg tgctgttcct ggacggcatc 2160 
gacaaggccc aggaggagca cgagaagtac cacagcaact ggcgcgccat ggccagcgag 2220 
ttcaacctgc cccccatcgt ggccaaggag atcgtggcca gctgcgacaa gtgccagctg 2280 
aagggcgagg ccatgcacgg ccaggtggac tgcagccccg gcatctggca gctggactgc 2340 
acccacctgg agggcaagat catcctggtg gccgtgcacg tggccagcgg ctacatggag 2400 
gccgaggtga tccccgccga gaccggccag gagaccgcct acttcatcct gaagctggcc 2460 
ggccgctggc ccgtgaaggt gatccacacc gacaacggca gcaacttcac cagcaccgcc 2520 
gtgaaggccg cctgctggtg ggccgacatc cagcgcgagt tcggcatccc ctacaacccc 2580 
cagagccagg gcgtggtgga gagcatgaac aaggagctga agaagatcat cggccaggtg 2640 
cgcgaccagg ccgagcacct gaagaccgcc gtgcagatgg ccgtgttcat ccacaacttc 2700 
aagcgcaagg gcggcatcgg cggctacagc gccggcgagc gcatcatcga catcatcgcc 2760 
agcgacatcc agaccaagga gctgcagaag cagatcatca agatccagaa cttccgcgtg 2820 
tactaccgcg acagccgcga ccccatctgg aagggccccg ccaagctgct gtggaagggc 2880 
gagggcgccg tggtgatcca ggacaacagc gacatcaagg tggtgccccg ccgcaaggcc 2940 
aagatcatca aggactacgg caagcagatg gccggcgccg actgcgtggc cggccgccag 3000 
gacgaggac 3009 

<210> 104 
<211> 3009 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: Pol_TV2_C_2Awt 
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<400> 104 

ttttttaggg aaaatttggc cttcccacaa 
accagagcca acagccccac cactagaacc 
caaggagact ccgaagcagg agccgaaaga 
ctttggcagc gaccccttgt ctcaataaaa 
gatacaggag cagatgatac agtactagaa 
aaaatgatag gaggaattgg aggttttatc 
gaaatttgtg gaaaaagggc tataggtaca 
attggaagaa atctgttgac tcagcttgga 
gaaactgtac cagtaaaatt aaagccagga 
ttgacagaag aaaaaataaa agcattaaca 
aaaattacaa aaattgggcc tgaaaatcca 
aaggacagta caaagtggag aaaattagta 
gacttttggg aagtccaatt aggaatacca 
gtgacagtac tggatgtggg agatgcatat 
aaatatactg cattcaccat acctagtata 
tataatgttc ttccacaggg atggaaagga 
agaatcttag agccctttag aacacaaaac 
gacttatatg taggatctga cttagaaata 
agaggacacc tattgaaatg gggatttacc 
ccatttcttt ggatggggta tgaactccat 
ctgccagaaa aggagagctg gactgtcaat 
tgggcaagtc agatttaccc agggattaaa 
gccaaagcac taacagacat agtgccactg 
aacagggaaa ttctaaaaga accagtacat 
atagctgaaa tacagaaaca ggggaatgac 
tttaaaaatc tgagaacagg aaagtatgca 
aaacagttag cagaggcagt gcaaaagata 
actcctaaat ttagactacc catcccaaaa 
tggcaagcca cctggattcc tgagtgggag 
tggtaccagc tggaaaaaga acccatagta 
gccaataggg aaactaaaat aggaaaagca 
gttgtttcct tcactgaaac aacaaatcag 
ttgcaggatt cagggccaga agtaaacata 
attcaagcac aaccagataa gagtgaatca 
ataaaaaagg aaaaagtcta cctatcatgg 
gaacaagtag acaaattagt aagtagtgga 
gataaagctc aagaagagca tgaaaaatat 
tttaatctgc cacccatagt agcaaaggaa 
aaaggggaag ccatgcatgg acaagtcgac 
acacatttag aaggaaaaat catcctagta 
gcagaggtta tcccagcaga aacaggacaa 
ggaagatggc cagtcaaagt aatacataca 
gttaaggcag cctgttggtg ggcagatatc 
caaagtcaag gagtagtaga atccatgaat 
agagatcaag ctgagcacct taagacagca 
aaaagaaaag gggggattgg ggggtacagt 
tcagacatac aaactaaaga attacaaaaa 
tattacagag acagcagaga ccctatttgg 
gaaggggcag tagtaataca agataatagt 
aaaatcatta aggactatgg aaaacagatg 
gatg^^gat. 

<210> 105 
<211> 75 
<212> DNA 



ggggaggcca gggaatttcc ttcagagcag 60 
aacagcccca ccagcagaga gcttcaagtt 120 
cagggaacct ttaacttccc tcaaatcact 180 
gtagcgggcc aaacaaagga ggctctttta 240 
gaaataaact tgccaggaaa atggaaacca 300 
aaagtaagac agtatgatca aatacttata 360 
gtattagtag gacctacacc tgtcaacata 420 
tgcacactaa attttccaat tagccccatt 480 
atggatggcc caaaggttaa acaatggcca 540 
gaaatttgtg aggaaatgga gaaggaagga 600 
tataacactc cagtatttgc cataaagaag 660 
gatttcaggg aactcaataa aagaactcaa 720 - 
cacccagcag ggttaaaaaa gaaaaaatca 780 
ttttcagtcc ctttagatga gagcttcaga 840 
aacaatgetaa caccagggat tagatatcaa 900 
tcaccagcaa tattccagag tagcatgaca 960 
ccagaagtag ttatctatca atatatggat 1020 
gggcaacata gagcaaaaat agaggagtta 1080 
acaccagaca agaaacatca gaaagaaccc 1140 
cctgacaaat ggacagtaca gcctatacag 1200 
gatatacaga agttagtggg aaagttaaac 1260 
gtaaggcaac tgtgtaaact ccttagggga 1320 
actgaagaag cagaattaga attggctgag 1380 
ggagtatatt atgacccatc aaaagattta 1440 
caatggacat atcaaattta ccaagaacca 1500 
aaaatgagga ctgcccacac taatgatgtg 1560 
acccaggaaa gcatagtaat atggggaaaa 1620 
gaaacatggg agacatggtg gtcagactat 1680 
tttgtcaata cccctcccct agtaaaattg 1740 
ggggcagaaa ctttctatgt agatggagca 1800 
gggtatgtca ctgacaaagg aaggcagaaa 1860 
aagactgaat tacaagcaat tcagctagct 1920 
gtaacagact cacagtatgc attaggaatc 1980 
gaattagtca gtcaaataat agaacagttg 2040 
gtaccagcac ataaaggaat tggaggaaat 2100 
atcagaaaag tactgtttct agatggaata 2160 
cacagcaatt ggagagcaat ggctagtgag 2220 
atagtagcca gctgtgataa atgtcagcta 2280 
tgtagtccag gaatatggca attagactgt 2340 
gcagtccatg tagccagtgg ctacatggaa 2400 
gaaacagcat actttatact aaaattagca 2460 
gataatggca gtaatttcac cagtaccgca 2520 
caacgggaat ttggaattcc ctacaatccc 2580 
aaagaattaa agaaaatcat agggcaagta 2640 
gtacaaatgg cagtattcat tcacaatttt 2700 
gcaggggaga gaataataga cataatagca 2760 
caaattataa aaattcaaaa ttttcgggtt 2820 
aaaggaccag ccaaactact ctggaaaggt 2880 
gatataaagg tagtaccaag aaggaaagca 2940 
gcaggtgctg attgtgtggc aggtagacag 3000 

3009 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: RevExonl_TV2_C_ZAopt 
<400> 105 

atggccggcc gcagcggcga cagcgacgag gccctgctgc aggccatcaa gatcatcaag 60 
atcctgtacc agagc 75 

<210> 106 
<211> 76 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: RevBxonl_TV2_C_ZAwt 
<400> 106 

atggcaggaa gaagcggaga cagcgacgaa gcgctcctcc aagcaataaa gatcatcaag 60 
atcctctacc aaagca 76 

<210> 107 
<211> 246 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: RevExon2_TV2_C_ZAopt 
<400> 107 

ccctacccca agcccgaggg cacccgccag gcccgccgca accgccgccg ccgctggcgc 60 
gcccgccagc agcagatcca cagcatcagc gagcgcatcc tggacacctg cctgggccgc 120 
cccaccaagc ccgtgcccct gctgctgccc cccatcgagc gcctgcacat caactgcagc 180 
gagagcagcg gcaccagcgg cacccagtag agccagggca ccgccgaggg cgtgggcaac 240 
ccctaa 246 



<210> 108 
<211> 248 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: RevExon2_TV2_C_2Awt 
<400> 108 

acccttatcc caaacccgag gggacccgac aggctcggag gaatcgaaga agaaggtgga 60 
gagcaagaca gcagcagatc cattcgatta gtgagcggat tcttgacact tgcctgggac 120 
gacctacgaa gcctgtgcct cttctgctac caccgattga gagacttcat attaattgta 180 
gtgagagcag tggaacttct gggacacagt agtctcaggg gactgcagag ggggtgggga 240 
acccttaa 248 

<210> 109 
<211> 216 
<212> DNA 

<213> Artificial Sequence 



<220> 



63 



<223> Description of Artificial Sequence: TatExonl_TV2_C_ZAopt 
<400> 109 

atggagccca tcgaccccaa cctggagccc tggaaccacc ccggcagcca gcccaagacc 60 
gcctgcaacg gctgctactg caagcgctgc agctaccact gcctggtgtg cttccagaag 120 
aagggcctgg gcatctacta cggccgcaag aagcgccgcc agcgccgcag cgcccccccc IBO 
agcaacaagg accaccagga ccccctgccc aagcag 216 

<210> 110 
<211> 216 
<212> DKA 

<213> Artificial Sequence 

<220>_ ^ . . 

<223> Description of Artificial Sequence: TatExonl_TV2_C_ZAwt 

<400> 110 

atggagccaa tagatcctaa cctagaaccc tggaaccatc caggaagtca gcctaaaact 60 
gcttgtaatg ggtgttactg taaacgttgc agctatcatt gtctagtttg ctttcagaaa 120 
aaaggcttag gcatttacta tggcaggaag aagcggagac agcgacgaag cgctcctcca 180 
agcaataaag atcatcaaga tcctctacca aagcag 216 

<210> 111 
<211> 90 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: TatExon2_TV2_C_ZAopt 
<400> 111 

cccctgagcc agacccgcgg cgaccccacc ggcagcgagg agagcaagaa gaaggtggag 60 
agcaagaccg ccgccgaccc cttcgactag 90 

<210> 112 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: TatExon2_TV2_C_ZAwt 
<400> 112 

cccttatccc aaacccgagg ggacccgaca ggctcggagg aatcgaagaa gaaggtggag 60 
agcaagacag cagcagatcc attcgattag 90 

<210> 113 
<211> 579 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vi f ^TV2_C_ZAopt 
<400> 113 

atggagaacc gctggcaggt gctgatcgtg tggcaggtgg accgcatgaa gatccgcacc 60 
tggcacagcc tggtgaagca ccacatgtac gtgagccgcc gcgccgacgg ctggttctac 120 
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cgccaccact acgagagccg ccaccccaag gtgagcagcg aggtgcacat ccccctgggc 180 

gacgcccgcc tggtgatcaa gacctactgg ggcctgcaga ccggcgagcg cgcctggcac 240 

ctgggccacg gcgtgagcat cgagtggcgc ctgcgccgct acagcaccca ggtggacccc 300 

gacctgaccg accagctgat ccacatgcac tacttcgact gcttcgccga gagcgccatc 360 

cgcaaggcca tcctgggcca gatcgtgagc cccaagtgcg actaccaggc cggccacaac 420 

aaggtgggca gcctgcagta cctggccctg accgccctga tcaagcccaa gaagatcaag 480- 

ccccccctgc ccagcgtgcg caagctggtg gaggaccgct ggaacaagcc ccagaagacc 540 
cgcggccgcc gcggcaacca caccatgaac ggccactag 579 

<210> 114 
<211> 579 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vif_TV2_C_ZAwt 
<400> 114 

atggaaaaca gatggcaggt gctgattgtg tggcaggtag acaggatgaa gattagaaca 60 
tggcacagtt tagtaaagca ccatatgtat gtttcgagga gagctgatgg atggttctac 120 
agacatcatt atgaaagcag acacccaaaa gtaagttcag aagtacacat cccattagga 180 
gatgccaggt tagtaataaa aacatattgg ggtctgcaga caggagaaag agcttggcat 240 
ttgggtcacg gagtctccat agaatggaga ttgagaagat atagcacaca agtagaccct 300 
gacctgacag accaactaat tcatatgcat tattttgatt gttttgcaga atctgccata 360 
aggaaagcca tactaggaca gatagttagc cctaagtgtg actatcaagc aggacataac 420 
aaggtaggat ctctacaata cttggcactg acagcattga taaaaccaaa aaagataaag 480 
ccacctctgc ctagtgttag gaaattagta gaggatagat ggaacaagcc ccagaagacc 540 
«L9999ccgca gagggaacca tacaatgaat ggacactag 579 

<210> 115 
<211> 288 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: >^r_TV2_C_2Aopt 



<400> 115 

atggagcagg cccccgagga ccagggcccc 
gagctgctgg aggagctgaa gcaggaggcc 
aacctgggcc agcacatcta cgagacctac 
atccgcatcc tgcagcagct gctgttcatc 
atcggcatcc tgcgccagcg ccgcgcccgc 

<210> 116 
<211> 288 
<212> DNA 

<213> Artificial Sequence 



cagcgcgagc cctacaacga gtggaccctg 60 
gtgcgccact tcccccgccc ctggctgcac 120 
ggcgacacct ggaccggcgt ggaggccatc 180 
cacttccgca tcggctgcca ccacagccgc 240 
aacggcgcca accgcagc 288 



<220> 

<223> Description of Artificial Sequence: Vpr_TV2_C_ZAwt 



<400> 116 

atggaacaag ccccagaaga ccaggggccg 
gagcttttag aagaactcaa gcaggaagct 
aacttaggac aacatatcta tgaaacctat 
ataagaatcc tgcaacaatt actgtttatt 



cagagggaac catacaatga atggacacta 60 
gtcagacact ttcctagacc atggctccat 120 
ggagatactt ggacaggagt tgaagcaata 180 
catttcagga ttgggtgcca tcatagcaga 240 



65 



ataggcattt tgcgacagag aagagcaaga aatggagcca atagatcc 



288 



<210> 117 
<211> 261 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



Vpu_TV2_C_ZAopt 



<400> 117 

atgctggacc tgaccgcccg catcgacagc cgcctgggca tcggcgccct gatcgtggcc 60 
ctgatcatcg ccatcatcgt gtggaccatc gtgtacatcg agtaccgcaa gctggtgcgc 120 
cagcgcaaga tcgactggct ggtgaagcgc atccgcgagc gcgccgagga cagcggcaac 180 
gagagcgagg gcgacaccga ggagctgagc accctggtgg acatgggcca cctgcgcctg 240 
ctggacgcca acgacgtgta a 261 

<210> 118 
<211> 261 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 



Vpu_TV2_C_ZAwt 



<400> 118 

atgttagatt taactgcaag aatagattct agattaggaa taggagcatt gatagtagca 60 
ctaatcatag caataatagt gtggaccata gtatatatag aatataggaa attggtaagg 120 
caaaggaaaa tagactggtt agttaaaagg attagggaaa gagcagaaga cagtggcaat 180 
gagagcgagg gggatactga agaattatcg acactggtgg atatggggca tcttaggctt 240 
ttggatgcta atgatgtgta a 261 

<210> 119 
<211> 1473 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 



gpl20mod .TVl . delV2 



<400> 119 

gaattcatgc 

ctgggcttct 

ggcgtgcccg 

tacgagaccg 

ccccaggaga 

gccgaccaga 

aagctgaccc 

accgtgaccg 

atgaagaact 

atcacccagg 

gccggctacg 

aacgtgagca 

ctgaacggca 

accaagacca 

aacaacaccc 

gtgatcggca 



gcgtgatggg 
ggatgctgat 
tgtggcgcga 
aggtgcacaa 
tcgtgctggg 
tgcacgagga 
ccctgtgcgt 
gcaacagcac 
gcagcttcaa 
cctgccccaa 
ccatcctgaa 
ccgtgcagtg 
gcctggccga 
tcatcgtgca 
gcaagagcgt 
acatccgcca 



cacccagaag 
gatctgcaac 
cgccaagacc 
cgtgtgggcc 
caacgtgacc 
cgtgatcagc 
gaccctgaac 
caacaacacc 
cgccggcgcc 
ggtgagcttc 
gtgcaacaac 
cacccacggc 
ggagggcatc 
cctgaacgag 
gcgcatcggc 
ggcccactgc 



aactgccagc 
accgaggacc 
accctgttct 
acccacgcct 
gagaacttca 
ctgtgggacc 
tgcaccgaca 
aacggcaccg 
ggccgcctga 
gaccccatcc 
aagaccttca 
atcaagcccg 
atcatccgca 
agcgtggaga 
cccggccagg 
aacatcagca 



agtggtggat 
tgtgggtgac 
gcgccagcga 
gcgtgcccac 
acatgtggaa 
agagcctgaa 
ccaacgtgac 
gcatctacaa 
tcaactgcaa 
ccatccacta 
acggcaccgg 
tggtgagcac 
gcgagaacct 
tcaactgcac 
ccttctacgc 
ccgaccgctg 



ctggggcatc 60 
cgtgtactac 120 
cgccaaggcc 180 
cgaccccaac 240 
gaacgacatg 300 
gccctgcgtg 360 
cggcaaccgc 420 
catcgaggag 480 
caccagcacc 540 
ctgcgccccc 600 
cccctgctac 660 
ccagctgctg 720 
gaccgagaac 780 
ccgccccaac 840 
caccaacgac 900 
gaacaagacc 960 
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ctgcagcagg tgatgaagaa gctgggcgag 

ccccacgccg gcggcgacct ggagatcacc 

ttctactgca acaccagcaa cctgttcaac 

aagtacaacg gcaacagcag cagccccatc 

cgcatgtggc agggcgtggg ccaggccacc 

tgccgcagca acatcaccgg catcctgctg 

aacaccgaga ccttccgccc cggcggcggc 

tacaagtaca aggtggtgga gatcaagccc 

cgcgtggtgc agcgcgagaa gcgctaactc 

<210> 120 
<21X> 1986 
<212> DNA 

<213> Artificial Sequence 



cacttcccca acaagaccat ccagttcaag 1020 
atgcacagct tcaactgccg cggcgagttc 1080 
agcacctacc acagcaacaa cggcacctac 1140 
accctgcagt gcaagatcaa gcagatcgtg 1200 
tacgcccccc ccatcgccgg caacatcacc 1260 
acccgcgacg gcggcttcaa caccaccaac 1320 
gacatgcgcg acaactggcg cagcgagctg 1380 
ctgggcatcg cccccaccaa ggccaagcgc 1440 
gag 1473 



<220> 

<223> Description of Artificial Sequence: gpl40mod.TVl.delV2 
<400> 120 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 
ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 
ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 
tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 
gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 
aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 
accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 
atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 
atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 
gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 
aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 
ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 
accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 
aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 
gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 
ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 
ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 
ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 
aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 
cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 1260 
tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 
aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 
tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccaagcgc 1440 
cgcgtggtgc agcgcgagaa gcgcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 
ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 
ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 1620 
cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 
gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 
tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 
gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 
aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 1920 
gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatctaa 1980 
ctcgag 1986 

<210> 121 
<211> 1986 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: gpl40mod.TVl .mut? .delV2 
<400> 121 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 
ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 
ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 
tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 
gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 
aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 
accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 
atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 
atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 
gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 
aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 
ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 
accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 
aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 
gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 
ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 
ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 
ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 
aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 
cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 1260 
tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 
aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 
tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccatcagc 1440 
agcgtggtgc agagcgagaa gagcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 
ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 
ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 1620 
cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 
gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 
tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 
gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 
aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 1920 
gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatctaa 1980 
ctcgag 1986 



<210> 122 
<i211> 2397 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: gpl60mod.TVl .delVlV2 



<400> 122 

gaattcatgc gcgtgatggg cacccagaag 
ctgggcttct ggatgctgat gatctgcaac 
ggcgtgcccg tgtggcgcga cgccaagacc 
tacgagaccg aggtgcacaa cgtgtgggcc 
ccccaggaga tcgtgctggg caacgtgacc 
gccgaccaga tgcacgagga cgtgatcagc 
aagctgaccc ccctgtgcgt gggcgccggc 



aactgccagc agtggtggat ctggggcatc 60 
accgaggacc tgtgggtgac cgtgtactac 120 
accctgttct gcgccagcga cgccaaggcc 180 
acccacgcct gcgtgcccac cgaccccaac 240 
gagaacttca acatgtggaa gaacgacatg 300 
ctgtgggacc agagcctgaa gccctgcgtg 360 
aactgcaaca ccagcaccat cacccaggcc 420 
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tgccccaagg tgagcttcga ccccatcccc atccactact gcgcccccgc cggctacgcc 480 
atcctgaagt gcaacaacaa gaccttcaac ggcaccggcc cctgctacaa cgtgagcacc 540 
gtgcagtgca cccacggcat caagcccgtg gtgagcaccc agctgctgct gaacggcagc 600 
ctggccgagg agggcatcat catccgcagc gagaacctga ccgagaacac caagaccatc 660 
atcgtgcacc tgaacgagag cgtggagatc aactgcaccc gccccaacaa caacacccgc 720 
aagagcgtgc gcatcggccc cggccaggcc ttctacgcca ccaacgacgt gatcggcaac 780 
atccgccagg cccactgcaa catcagcacc gaccgctgga acaagaccct gcagcaggtg 840 
atgaagaagc tgggcgagca cttccccaac aagaccatcc agttcaagcc ccacgccggc 900 
ggcgacctgg agatcaccat gcacagcttc aactgccgcg gcgagttctt ctactgcaac 960 
accagcaacc tgttcaacag cacctaccac agcaacaacg gcacctacaa gtacaacggc 1020 
aacagcagca gccccatcac cctgcagtgc aagatcaagc agatcgtgcg catgtggcag 1080 
ggcgtgggcc aggccaccta cgcccccccc atcgccggca acatcacctg ccgcagcaac 1140 
atcaccggca tcctgctgac ccgcgacggc ggcttcaaca ccaccaacaa caccgagacc 1200 
ttccgccccg gcggcggcga catgcgcgac aactggcgca gcgagctgta caagtacaag 1260 
gtggtggaga tcaagcccct gggcatcgcc cccaccaagg ccaagcgccg cgtggtgcag 1320 
cgcgagaagc gcgccgtggg catcggcgcc gtgttcctgg gcttcctggg cgccgccggc 1380 
agcaccatgg gcgccgccag catcaccctg accgtgcagg cccgccagct gctgagcggc 1440 
atcgtgcagc agcagagcaa cctgctgaag gccatcgagg cccagcagca catgctgcag 1500 
ctgaccgtgt ggggcatcaa gcagctgcag gcccgcgtgc tggccatcga gcgctacctg 1560 
aaggaccagc agctgctggg catctggggc tgcagcggcc gcctgatctg caccaccgcc 1620 
gtgccctgga acagcagctg gagcaacaag agcgagaagg acatctggga caacatgacc 1680 
tggatgcagt gggaccgcga gatcagcaac tacaccggcc tgatctacaa cctgctggag 1740 
gacagccaga accagcagga gaagaacgag aaggacctgc tggagctgga caagtggaac 1800 
aacctgtgga actggttcga catcagcaac tggccctggt acatcaagat cttcatcatg 1860 
atcgtgggcg gcctgatcgg cctgcgcatc atcttcgccg tgctgagcat cgtgaaccgc 1920 
gtgcgccagg gctacagccc cctgagcttc cagaccctga cccccagccc ccgcggcctg 1980 
gaccgcctgg gcggcatcga ggaggagggc ggcgagcagg accgcgaccg cagcatccgc 2040 
ctggtgagcg gcttcctgag cctggcctgg gacgacctgc gcaacctgtg cctgttcagc 2100 
taccaccgcc tgcgcgactt catcctgatc gccgtgcgcg ccgtggagct gctgggccac 2160 
agcagcctgc gcggcctgca gcgcggctgg gagatcctga agtacctggg cagcctggtg 2220 
cagtactggg gcctggagct gaagaagagc gccatcagcc tgctggacac catcgccatc 2280 
accgtggccg agggcaccga ccgcatcatc gagctggtgc agcgcatctg ccgcgccatc 2340 
ctgaacatcc cccgccgcat ccgccagggc ttcgaggccg ccctgctgta actcgag 2397 

<210> 123 
<211> 2529 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: gpieOmod.TVl .delV2 
<400> 123 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 
ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 
ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 
tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 
gccgaccaga tgcacgagga cgtgatcagc . ctgtgggacc agagcctgaa gccctgcgtg 360 
aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 
accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 
atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 
atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 
gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 
aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 
ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 
accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 
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aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 
gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 
ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 
ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 
ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 
aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 
cgcatgtggc agggcgtggg ccaggccacc tacgcccccc. ccatcgccgg caacatcacc 1260 
tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 
aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 
tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccaagcgc 1440 
cgcgtggtgc agcgcgagaa gcgcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 
ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 
ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 1620 
cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 
gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 
tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 
gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cc tgatctac i860 
aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 1920 
gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatcaag 1980 
atcttcatca tgatcgtggg cggcctgatc ggcctgcgca tcatcttcgc cgtgctgagc 204 0 
atcgtgaacc gcgtgcgcca gggctacagc cccctgagct tccagaccct gacccccagc 2100 
ccccgcggcc tggaccgcct gggcggcaCc gaggaggagg gcggcgagca ggaccgcgac 2160 
cgcagcatcc gcctggtgag cggcttcctg agcctggcct gggacgacct gcgcaacctg 2220 
tgcctgttca gctaccaccg cctgcgcgac ttcatcctga tcgccgtgcg cgccgtggag 2280 
ctgctgggcc acagcagcct gcgcggcctg cagcgcggct gggagatcct gaagtacctg 2340 
ggcagcctgg tgcagtactg gggcctggag ctgaagaaga gcgccatcag cctgctggac 2400 
accatcgcca tcaccgtggc cgagggcacc gaccgcatca tcgagctggt gcagcgcatc 2460 
tgccgcgcca tcctgaacat cccccgccgc atccgccagg gcttcgaggc cgccctgctg 2520 
taactcgag 252 9 

<210> 124 
<211> 2529 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: gpl60raod.TVl .mut7 .delV2 
<400> 124 

gaattcatgc gcgtgatggg 
ctgggcttct ggatgctgat 
ggcgtgcccg tgtggcgcga 
tacgagaccg aggtgcacaa 
ccccaggaga tcgtgctggg 
gccgaccaga tgcacgagga 
aagctgaccc ccctgtgcgt 
accgtgaccg gcaacagcac 
atgaagaact gcagcttcaa 
atcacccagg cctgccccaa 
gccggctacg ccatcctgaa 
aacgtgagca ccgtgcagtg 
ctgaacggca gcctggccga 
accaagacca tcatcgtgna 
aacaacaccc gcaagagcgt 
gtgatcggca acatccgcca 
ctgcagcagg tgatgaagaa 
ccccacgccg gcggcgacct 



cacccagaag aactgccagc agtggtggat ctggggcatc 60 
gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 
cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 
cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 
cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 
gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 
caacaacacc aacggcaccg gcatctacaa catcgaggag 480 
cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 540 
ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 600 
gtgcaacaac aagaccttca acggcaccgg cccctgctac 660 
cacccacggc atcaagcccg tggtgagcac ccagctgctg 720 
ggagggcatc atcatccgca gcgagaacct gaccgagaac 780 
rctgaacgag agcgtggaga tcaactgcac ccgccccaac 840 
gcgcatcggc cccggccagg ccttctacgc caccaacgac 900 
ggcccactgc aacatcagca ccgaccgctg gaacaagacc 960 
gctgggcgag cacttcccca acaagaccat ccagttcaag 1020 
ggagatcacc atgcacagct tcaactgccg cggcgagttc 1080 
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ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 1140 
aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 1200 
cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 1260 
tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 1320 
aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 1380 
tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccatcagc 1440 
agcgtggtgc agagcgagaa gagcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 1500 
ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 1560 
ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 1620 
cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 1680 
gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 1740 
tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 1800 
gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 1860 
aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 1920 
gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatcaag 1980 
atcttcatca tgatcgtggg cggcctgatc ggcctgcgca tcatcttcgc cgtgctgagc 2040 
atcgtgaacc gcgtgcgcca gggctacagc cccctgagct tccagaccct gacccccagc 2100 
ccccgcggcc tggaccgcct gggcggcatc gaggaggagg gcggcgagca ggaccgcgac 2160 
cgcagcatcc gcctggtgag cggcttcctg agcctggcct gggacgacct gcgcaacctg 2220 
tgcctgttca gctaccaccg cctgcgcgac ttcatcctga tcgccgtgcg cgccgtggag 2280 
ctgctgggcc acagcagcct gcgcggcctg cagcgcggct gggagatcct gaagtacctg 2340 
ggcagcctgg tgcagtactg gggcctggag ctgaagaaga gcgccatcag cctgctggac 2400 
accatcgcca tcaccgtggc cgagggcacc gaccgcatca tcgagctggt gcagcgcatc 2460 
tgccgcgcca tcctgaacat cccccgccgc atccgccagg gcttcgaggc cgccctgctg 2520 
taactcgag 2529 

<210> 125 
<211> 2613 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: gpl60mod.TVl,tpal 
<400> 125 

gtcgacgcca ccatggatgc aatgaagaga gggctctgct gtgtgctgct 
gcagtcttcg tttcgcccag cgccagcacc gaggacctgt gggtgaccgt 
gtgcccgtgt ggcgcgacgc caagaccacc ctgttctgcg ccagcgacgc 
gagaccgagg tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga 
caggagatcg tgctgggcaa cgtgaccgag aacttcaaca tgtggaagaa 
gaccagatgc acgaggacgt gatcagcctg tgggaccaga gcctgaagcc 
ctgacccccc tgtgcgtgac cctgaactgc accgacacca acgtgaccgg 
gtgaccggca acagcaccaa caacaccaac ggcaccggca tctacaacat 
aagaactgca gcttcaacgc caccaccgag ctgcgcgaca agaagcacaa 
ctgttctacc gcctggacat cgtgcccctg aacgagaaca gcgacaactt 
ctgatcaact gcaacaccag caccatcacc caggcctgcc ccaaggtgag 
atccccatcc actactgcgc ccccgccggc tacgccatcc tgaagtgcaa 
ttcaacggca ccggcccctg ctacaacgtg agcaccgtgc agtgcaccca 
cccgtggtga gcacccagct gctgctgaac ggcagcctgg ccgaggaggg 
cgcagcgaga acctgaccga gaacaccaag accatcatcg tgcacctgaa 
gagatcaact gcacccgccc caacaacaac acccgcaaga gcgtgcgcat 
caggccttct acgccaccaa cgacgtgatc ggcaacatcc gccaggccca 
agcaccgacc gctggaacaa gaccctgcag caggtgatga agaagctggg 
cccaacaaga ccatccagtt caagccccac gccggcggcg acctggagat 
agcttcaact gccgcggcga gttcttctac tgcaacacca gcaacctgtt 
taccacagca acaacggcac ctacaagtac aacggcaaca gcagcagccc 
cagtgcaaga tcaagcagat cgtgcgcatg tggcagggcg tgggccaggc 
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gctgtgtgga 60 
gtactacggc 120 
caaggcctac 180 
ccccaacccc 240 
cgacatggcc 300 
ctgcgtgaag 360 
caaccgcacc 420 
cgaggagatg 480 
ggagtacgcc 540 
cacctaccgc 600 
cttcgacccc 660 
caacaagacc 720 
cggcatcaag 780 
catcatcatc 840 
cgagagcgtg 900 
cggccccggc 960 
ctgcaacatc 1020 
cgagcacttc 1080 
caccatgcac 1140 
caacagcacc 1200 
catcaccctg 1260 
cacctacgcc 1320 



ccccccatcg ccggcaacat cacctgccgc agcaacatca ccggcatcct gctgacccgc X380 
gacggcggct tcaacaccac caacaacacc gagaccttcc gccccggcgg cggcgacatg 1440 
cgcgacaact ggcgcagcga gctgtacaag tacaaggtgg tggagatcaa gcccctgggc 1500 
atcgccccca ccaaggccaa gcgccgcgtg gtgcagcgcg agaagcgcgc cgtgggcatc 1560 
ggcgccgtgt tcctgggctt cctgggcgcc gccggcagca ccatgggcgc cgccagcatc 1620 
accctgaccg tgcaggcccg ccagctgctg agcggcatcg tgcagcagca gagcaacctg 1680 
ctgaaggcca tcgaggccca gcagcacatg ctgcagctga ccgtgtgggg catcaagcag 1740 
ctgcaggccc gcgtgctggc catcgagcgc tacctgaagg accagcagct gctgggcatc 1800 
tggggctgca gcggccgcct gatctgcacc accgccgtgc cctggaacag cagctggagc 1860 
aacaagagcg agaaggacat ctgggacaac atgacctgga tgcagtggga ccgcgagatc 1920 
agcaactaca ccggcctgat ctacaacctg ctggaggaca gccagaacca gcaggagaag 1980 
aacgagaagg acctgctgga gctggacaag tggaacaacc tgtggaactg gttcgacatc 2040 
agcaactggc cctggtacat caagatcttc atcatgatcg tgggcggcct gatcggcctg 2100 
cgcatcatct tcgccgtgct gagcatcgtg aaccgcgtgc gccagggcta cagccccctg 2160 
agcttccaga ccctgacccc cagcccccgc ggcctggacc gcctgggcgg catcgaggag 2220 
gagggcggcg agcaggaccg cgaccgcagc atccgcctgg tgagcggctt cctgagcctg 2280 
gcctgggacg acctgcgcaa cctgtgcctg ttcagctacc accgcctgcg cgacttcatc 2340 
ctgatcgccg tgcgcgccgt ggagctgctg ggccacagca gcctgcgcgg cctgcagcgc 2400 
ggctgggaga tcctgaagta cctgggcagc ctggtgcagt actggggcct ggagctgaag 2460 
aagagcgcca tcagcctgct ggacaccatc gccatcaccg tggccgaggg caccgaccgc 2520 
atcatcgagc tggtgcagcg catctgccgc gccatcctga acatcccccg ccgcatccgc 2580 
cagggcttcg aggccgccct gctgtaactc gag 2613 

<210> 126 
<211> 2616 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: gpl60mod.TVl 
<400> 126 

gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 60 
ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 120 
ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 
tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 
gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 
aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 
accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 460 
atgaagaact gcagcttcaa cgccaccacc gagctgcgcg acaagaagca caaggagtac 540 
gccctgttct accgcctgga catcgtgccc ctgaacgaga acagcgacaa cttcacctac 600 
cgcctgatca actgcaacac cagcaccatc acccaggcct gccccaaggt gagcttcgac 660 
cccatcccca tccactactg cgcccccgcc ggctacgcca tcctgaagtg caacaacaag 720 
accttcaacg gcaccggccc ctgctacaac gtgagcaccg tgcagtgcac ccacggcatc 780 
aagcccgtgg tgagcaccca gctgctgctg aacggcagcc tggccgagga gggcatcatc 840 
atccgcagcg agaacctgac cgagaacacc aagaccatca tcgtgcacct gaacgagagc 900 
gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 
ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 
atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 
ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 
cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 1200 
acctaccaca gcaacaacgg cacctacaag tacaacggca acaqcaqcag ccccatcacc 1260 
ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 1320 
gccGCCCCca tcgccggcaa catcacctgc cgcagcaaca tcaccggcat cctgctgacc 1380 
cgcgacggcg gcttcaacac caccaacaac accgagacct tccgccccgg cggcggcgac 1440 
atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 
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ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 1560 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 
atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 
ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 
cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 
atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg i860 
agcaacaaga gcgagaagga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 
atcagcaact acaccggcct gatctacaac ctgctggagg acagccagaa ccagcaggag 1980 
aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 
atcagcaact ggccctggta catcaagatc ttcatcatga tcgtgggcgg cctgatcggc 2100 
ctgcgcatca tcttcgccgt gctgagcatc gtgaaccgcg tgcgccaggg ctacagcccc 2160 
ctgagcttcc agaccctgac ccccagcccc cgcggcctgg accgcctggg cggcatcgag 2220 
gaggagggcg gcgagcagga ccgcgaccgc agcatccgcc tggtgagcgg cttcctgagc 2280 
ctggcctggg acgacctgcg caacctgtgc ctgttcagct accaccgcct gcgcgacttc 2340- 
atcctgatcg ccgtgcgcgc cgtggagctg ctgggccaca gcagcctgcg cggcctgcag 2400 
cgcggctggg agatcctgaa gtacctgggc agcctggtgc agtactgggg cctggagctg 2460 
aagaagagcg ccatcagcct gctggacacc atcgccatca ccgtggccga gggcaccgac 2520 
cgcatcatcg agctggtgca gcgcatctgc cgcgccatcc tgaacatccc ccgccgcatc 2580 
cgccagggct tcgaggccgc cctgctgtaa ctcgag 2616 

<210> 127 
<211> 2616 
<212> DNA 

<213> Artificial Sequence 



<220> . 

<223> Description of Artificial Sequence: gpl60inod.TVl .wtLnative 



<400> 127 

gaattcatga gagtgatggg 
ttaggcttct ggatgctaat 
ggcgtgcccg tgtggcgcga 
tacgagaccg aggtgcacaa 
ccccaggaga tcgtgctggg 
gccgaccaga tgcacgagga 
aagctgaccc ccctgtgcgt 
accgtgaccg gcaacagcac 
atgaagaact gcagcttcaa 
gccctgttct accgcctgga 
cgcctgatca actgcaacac 
cccatcccca tccactactg 
accttcaacg gcaccggccc 
aagcccgtgg tgagcaccca 
atccgcagcg agaacctgac 
gtggagatca actgcacccg 
ggccaggcct tctacgccac 
atcagcaccg accgctggaa 
ttccccaaca agaccatcca 
cacagcttca actgccgcgg 
acctaccaca gcaacaacgg 
ctgcagtgca agatcaagca 
gcccccccca tcgccggcaa 
cgcgacggcg gcttcaacac 
atgcgcgaca actggcgcag 
ggcatcgccc ccaccaaggc 
atcggcgccg tgttcctggg 
atcaccctga ccgtgcaggc 



gacacagaag aattgtcaac 
gatttgtaac accgaggacc 
cgccaagacc accctgttct 
cgtgtgggcc acccacgcct 
caacgtgacc gagaacttca 
cgtgatcagc ctgtgggacc 
gaccctgaac tgcaccgaca 
caacaacacc aacggcaccg 
cgccaccacc gagctgcgcg 
catcgtgccc ctgaacgaga 
cagcaccatc acccaggcct 
cgcccccgcc ggctacgcca 
ctgctacaac gtgagcaccg 
gctgctgctg aacggcagcc 
cgagaacacc aagaccatca 
ccccaacaac aacacccgca 
caacgacgtg atcggcaaca 
caagaccctg. cagcaggtga 
gttcaagccc cacgccggcg 
cgagttcttc tactgcaaca 
cacctacaag tacaacggca 
gatcgtgcgc atgtggcagg 
catcacctgc cgcagcaaca 
caccaacaac accgagacct 
cgagctgtac aagtacaagg 
caagcgccgc gtggtgcagc 
cttcctgggc gccgccggca 
ccgccagctg ctgagcggca 



aatggtggat atggggcatc 60 
tgtgggtgac cgtgtactac 120 
gcgccagcga cgccaaggcc 180 
gcgtgcccac cgaccccaac 240 
acatgtggaa gaacgacatg 300 
agagcctgaa gccctgcgtg 360 
ccaacgtgac cggcaaccgc 420 
gcatctacaa catcgaggag 480 
acaagaagca caaggagtac 540 
acagcgacaa cttcacctac 600 
gccccaaggt gagcttcgac 660 
tcctgaagtg caacaacaag 720 
tgcagtgcac ccacggcatc 780 
tggccgagga gggcatcatc 840 
tcgtgcacct gaacgagagc 900 
agagcgtgcg catcggcccc 960 
tccgccaggc ccactgcaac 1020 
tgaagaagct gggcgagcac 1080 
gcgacctgga gat caeca tg 1140 
ccagcaacct gttcaacagc 1200 
acagcagcag ccccatcacc 1260 
gcgtgggcca ggccacctac 1320 
tcaccggcat cctgctgacc 1380 
tccgccccgg cqqcqgcqac 1440 
tggtggagat caagcccctg 1500 
gcgagaagcg cgccgtgggc 1560 
gcaccatggg cgccgccagc 1620 
tcgtgcagca gcagagcaac 1680 
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ctgctgaagg ccatcgaggc 
cagctgcagg cccgcgtgct 
atctggggct gcagcggccg 
agcaacaaga gcgagaagga 
atcagcaact acaccggcct 
aagaacgaga aggacctgct 
atcagcaact ggccctggta 
ctgcgcatca tcttcgccgt 
ctgagcttcc agaccctgac 
gaggagggcg gcgagcagga 
ctggcctggg acgacctgcg 
atcctgatcg ccgtgcgcgc 
cgcggctggg agatcctgaai 
aagaagagcg ccatcagcct 
cgcatcatcg agctggtgca 
cgccagggct tcgaggccgc 

<210> 128 
<211> 2604 
<212> DNA 
<213> Artificial Secpience 

<220> 

<223> Description of Artificial Sequence: Wild-type Env 
gpl60 (8_2_ZA) 

<:400> 128 

atgagagtga tggggacaca gaagaattgt caacaatggt ggatatgggg catcttaggc 60 
ttctggatgc taatgatttg taacacggag gacttgtggg tcacagtcta ctatggggta 120 
cctgtgtgga gagacgcaaa aactactcta ttctgtgcat cagatgctaa agcatatgag 180 
acagaagtgc ataatgtctg ggctacacat gcctgtgtac ccacagaccc caacccacaa 240 
gaaatagttt tgggaaatgt aacagaaaat tttaatatgt ggaaaaatga catggcagat 300 
cagatgcatg aggatgtaat cagtttatgg gatcaaagcc taaagccatg tgtaaagttg 360 
accccactct gtgtcacttt aaactgtaca gatacaaatg ttacaggtaa tagaactgtt 420 
acaggtaata gtaccaataa tacaaatggt acaggtattt ataacattga agaaatgaaa 480 
aattgctctt tcaatgcaac cacagaatta agagataaga aacataaaga gtatgcactc 540 
ttttatagac ttgatatagt accacttaat gagaatagtg acaactttac atatagatta 600 
ataaattgca atacctcaac cataacacaa gcctgtccaa aggtctcttt tgacccgatt 660 
cctatacatt actgtgctcc agctggttat gcgattctaa agtgtaataa taagacattc 720 
aatgggacag gaccatgtta taatgtcagc acagtacaat gtacacatgg aattaagcca 780 
gtggtatcaa ctcaattact gttaaatggt agtctagcag aagaagggat aataattaga 840 
tctgaaaatt tgacagagaa taccaaaaca ataatagtac accttaatga atctgtagag 900 
attaattgta caagacccaa caataataca agaaaaagtg taaggatagg accaggacaa 960 
gcattctatg caacaaatga tgtaatagga aacataagac aagcacattg taacattagt 1020 
acagatagat ggaacaaaac tttacaacag gtaatgaaaa aattaggaga gcatttccct 1080 
aataaaacaa tacaatttaa accacatgca ggaggggatc tagaaattac aatgcatagc 1140 
tttaattgta gaggagaatt tttctattgt aatacatcaa acctgtttaa tagcacatac 1200 
cactctaata atggtacata caaatacaat ggtaattcaa gctcacccat cacactccaa 1260 
tgtaaaataa aacaaattgt acgcatgtgg caaggggtag gacaagcaac gtatgcccct 1320 
cccattgcag gaaacataac atgtagatca aacatcacag gaatactatt gacacgtgat 1380 
ggaggattta acaccacaaa caacacagag acattcagac ctggaggagg agatatgagg 1440 
gataactgga gaagtgaatt atataaatat aaagtagtag aaattaagcc attgggaata 1500 
gcacccacta aggcaaaaag aagagtggtg cagaqaqaaa aaagagcaqt qqqaataqqa 1560 
gctgtgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaataacg 1620 
ctgacggtac aggccagaca actgttgtct ggtatagtgc aacagcaaag caatttgctg 1680 
aaggctatag aggcgcaaca gcatatgttg caactcacag tctggggcat taagcagctc 1740 
caggcgagag tcctggctat agaaagatac ctaaaggatc aacagctcct agggatttgg 1800 



ccagcagcac atgctgcagc 
ggccatcgag cgctacctga 
cctgatctgc accaccgccg 
catctgggac aacatgacct 
gatctacaac ctgctggagg 
ggagctggac aagtggaaca 
catcaagatc ttcatcatga 
gctgagcatc gtgaaccgcg 
ccccagcccc cgcggcctgg 
ccgcgaccgc agcatccgcc 
caacctgtgc ctgttcagct 
cgtggagctg ctgggccaca 
gtacctgggc agcctggtgc 
gctggacacc atcgccatca 
gcgcatctgc cgcgccatcc 
cctgctgtaa ctcgag 



tgaccgtgtg gggcatcaag 1740 
aggaccagca gctgctgggc 1800 
tgccctggaa cagcagctgg 1860 
ggatgcagtg ggaccgcgag 1920 
acagccagaa ccagcaggag 1980 
acctgtggaa ctggttcgac 2040 
tcgtgggcgg cctgatcggc 2100 
tgcgccaggg ctacagcccc 2160 
accgcctggg cggcatcgag 2220 
tggtgagcgg cttcctgagc 2280 
accaccgcct gcgcgacttc 2340 
gcagcctgcg cggcctgcag 2400 
agtactgggg cctggagctg 2460 
ccgtggccga gggcaccgac 2520 
tgaacatccc ccgccgcatc 2580 

2616 
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ggctgctctg gaagactcat ctgcaccact gctgtgcctt ggaactccag ttggagtaat 1860 
aaatctgaaa aagatatttg ggataacatg acttggatgc agtgggatag agaaattagt 1920 
aattacacag gcttaatata caatttgctt gaagactcgc aaaaccagca ggaaaagaat 1980 
gaaaaagatt tattagaatt ggacaagtgg aacaatctgt ggaattggtt tgacatatca 2040 
aactggccgt ggtatataaa aatattcata atgatagtag gaggcttgat aggtttaaga 2100 
ataatttttg ctgtgctttc tatagtgaat agagttaggc agggatactc acctttgtca 2160 
tttcagaccc ttaccccaag cccgagggga ctcgacaggc tcggaggaat cgaagaagaa 2220 
ggtggagagc aagacagaga cagatccata cgattggtga gcggattctt gtcgcttgcc 2280 
tgggacgatc tgcggaacct gtgcctcttc agctaccacc gcttgagaga cttcatatta 2340 
attgcagtga gggcagtgga acttctggga cacagcagtc tcaggggact acagaggggg 2400 
tgggaaatcc ttaagtatct gggaagtctt gtgcaatatt ggggtctaga gctaaaaaag 2460 
agtgctatta gtctgcttga taccatagca ataacagtag ctgaaggaac agataggatt 2520 
atagaattag tacaaagaat ttgtagagct atcctcaaca tacctagaag aataagacag 2580 
ggctttgaag cagctttgct ataa 2604 

<2a0> 129 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: wild-type 
amino acid sequence changed by mutation in 
gpl20/gp41 cleavage site 

<400> 129 

Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
15 10 



<210> 130 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: wild-type 
amino acid sequence changed by mutation in 
gpl20/gp41 cleavage site 

<400> 130 

lie Ser Ser Val Val Gin Ser Glu Lys Ser 
1 5 10 



<210> 131 
<211> 2052 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: gpl40mod.TVl , tpal 
<400> 131 

atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt 60 
tcgcccagcg ccagcaccga ggacctgtgg gtgaccgtgt actacggcgt gcccgtgtgg 120 
cgcgacgcca agaccaccct gttctgcgcc agcgacgcca aggcctacga gaccgaggtg ISO 
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cacaacgtgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca ggagatcgtg 240 
ctgggcaacg tgaccgagaa cttcaacatg tggaagaacg acatggccga ccagatgcac 300 
gaggacgtga tcagcctgtg ggaccagagc ctgaagccct gcgtgaagct gacccccctg 360 
tgcgtgaccc tgaactgcac cgacaccaac gtgaccggca accgcaccgt gaccggcaac 420 
agcaccaaca acaccaacgg caccggcatc tacaacatcg aggagatgaa gaactgcagc 480 
ttcaacgcca ccaccgagct gcgcgacaag aagcacaagg agtacgccct gttctaccgc 540 
ctggacatcg tgcccctgaa cgagaacagc gacaacttca cctaccgcct gatcaactgc 600 
aacaccagca ccatcaccca ggcctgcccc aaggtgagct tcgaccccat ccccatccac 660 
tactgcgccc ccgccggcta cgccatcctg aagtgcaaca acaagacctt caacggcacc 720 
ggcccctgct acaacgtgag caccgtgcag tgcacccacg gcatcaagcc cgtggtgagc 780 
acccagctgc tgctgaacgg cagcctggcc gaggagggca tcatcatccg cagcgagaac 840 
ctgaccgaga acaccaagac catcatcgtg cacctgaacg agagcgtgga gatcaactgc 900 
acccgcccca acaacaacac ccgcaagagc gtgcgcatcg gccccggcca ggccttctac 960 
gccaccaacg acgtgatcgg caacatccgc caggcccact gcaacatcag caccgaccgc 1020 
tggaacaaga ccctgcagca ggtgatgaag aagctgggcg agcacttccc caacaagacc 1080 
atccagttca agccccacgc cggcggcgac ctggagatca ccatgcacag cttcaactgc 1140 
cgcggcgagt tcttctactg caacaccagc aacctgttca acagcaccta ccacagcaac 1200 
aacggcacct acaagtacaa cggcaacagc agcagcccca tcaccctgca gtgcaagatc 1260 
aagcagatcg tgcgcatgtg gcagggcgtg ggccaggcca cctacgcccc ccccatcgcc 1320 
ggcaacatca cctgccgcag caacatcacc ggcatcctgc tgacccgcga cggcggcttc 1380 
aacaccacca acaacaccga gaccttccgc cccggcggcg gcgacatgcg cgacaactgg 1440 
cgcagcgagc tgtacaagta caaggtggtg gagatcaagc ccctgggcat cgcccccacc 1500 
aaggccaagc gccgcgtggt gcagcgcgag aagcgcgccg tgggcatcgg cgccgtgttc 1560 
ctgggcttcc tgggcgccgc cggcagcacc atgggcgccg ccagcatcac cctgaccgtg 1620 
caggcccgcc agctgctgag cggcatcgtg cagcagcaga gcaacctgct gaaggccatc 1680 
gaggcccagc agcacatgct gcagctgacc gtgtggggca tcaagcagct gcaggcccgc 1740 
gtgctggcca tcgagcgcta cctgaaggac cagcagctgc tgggcatctg gggctgcagc 1800 
ggccgcctga tctgcaccac cgccgtgccc tggaacagca gctggagcaa caagagcgag 1860 
aaggacatct gggacaacat gacctggatg cagtgggacc gcgagatcag caactacacc 1920 
ggcctgatct acaacctgct ggaggacagc cagaaccagc aggagaagaa cgagaaggac 1980 
ctgctggagc tggacaagtg gaacaacctg tggaactggt tcgacatcag caactggccc 2040 
tggtacatct aa 2052 

<210> 132 
<211> 2073 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: gpl40mod.TVl 



<400> 132 

gaattcatgc gcgtgatggg cacccagaag 
ctgggcttct ggatgctgat gatctgcaac 
ggcgtgcccg tgtggcgcga cgccaagacc 
tacgagaccg aggtgcacaa cgtgtgggcc 
ccccaggaga tcgtgctggg caacgtgacc 
gccgaccaga tgcacgagga cgtgatcagc 
aagctgaccc ccctgtgcgt gaccctgaac 
accgtgaccg gcaacagcac caacaacacc 
atgaagaact gcagcttcaa cgccaccacc 
gccctgttct accgcctgga catcgtgccc 
cgcctgatca actgcaacac cagcaccatc 
cccatcccca tccactactg cgcccccgcc 
accttcaacg gcaccggccc ctgctacaac 
aagcccgtgg tgagcaccca gctgctgctg 
atccgcagcg agaacctgac cgagaacacc 



aactgccagc agtggtggat ctggggcatc 60 
accgaggacc tgtgggtgac cgtgtactac 120 
accctgttct gcgccagcga cgccaaggcc 180 
acccacgcct gcgtgcccac cgaccccaac 240 
gagaacttca acatgtggaa gaacgacatg 300 
ctgtgggacc agagcctgaa gccctgcgtg 360 
tgcaccgaca ccaacgtgac cggcaaccgc 420 
aacggcaccg gcatctacaa catcgaggag 480 
gagctgcgcg acaagaagca caaggagtac 540 
ctgaacgaga acagcgacaa cttcacctac 600 
acccaggcct qccccaaggt gagcttcgac 660 
ggctacgcca tcctgaagtg caacaacaag 720 
gtgagcaccg tgcagtgcac ccacggcatc 780 
aacggcagcc tggccgagga gggcatcatc 840 
aagaccatca tcgtgcacct gaacgagagc 900 
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gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 
ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 
atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 
ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 
cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 1200 
acctaccaca gcaacaacgg cacctacaag tacaacggca acagcagcag ccccatcacc 1260 
ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 1320 
gcccccccca tcgccggcaa catcacctgc cgcagcaaca tcaccggcat cctgctgacc 1380 
cgcgacggcg gcttcaacac caccaacaac accgagacct tccgccccgg cggcggcgac 1440 
atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 
ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 1560 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 
atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 
ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 
cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 
atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg 1860 
agcaacaaga gcgagaagga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 
atcagcaact acaccggcct gatctacaac ctgctggagg acagccagaa ccagcaggag 1980 
aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 
atcagcaact ggccctggta catctaactc gag 2073 

<210> 133 
<211> 2073 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: gpl40mod.TVl .wtLnative 
<400> 133 

gaattcatga gagtgatggg gacacagaag aattgtcaac aatggtggat atggggcatc 60 
ttaggcttcfc ggatgctaat gatttgtaac accgaggacc tgtgggtgac cgtgtactac 120 
ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 180 
tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 240 
ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 300 
gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 360 
aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 420 
accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 480 
atgaagaact gcagcttcaa cgccaccacc gagctgcgcg acaagaagca caaggagtac 540 
gccctgttct accgcctgga catcgtgccc ctgaacgaga acagcgacaa cttcacctac 600 
cgcctgatca actgcaacac cagcaccatc acccaggcct gccccaaggt gagcttcgac 660 
cccatcccca tccactactg cgcccccgcc ggctacgcca tcctgaagtg caacaacaag 720 
accttcaacg gcaccggccc ctgctacaac gtgagcaccg tgcagtgcac ccacggcatc 780 
aagcccgtgg tgagcaccca gctgctgctg aacggcagcc tggccgagga gggcatcatc 840 
atccgcagcg agaacctgac cgagaacacc aagaccatca tcgtgcacct gaacgagagc 900 
gtggagatca actgcacccg ccccaacaac aacacccgca agagcgtgcg catcggcccc 960 
ggccaggcct tctacgccac caacgacgtg atcggcaaca tccgccaggc ccactgcaac 1020 
atcagcaccg accgctggaa caagaccctg cagcaggtga tgaagaagct gggcgagcac 1080 
ttccccaaca agaccatcca gttcaagccc cacgccggcg gcgacctgga gatcaccatg 1140 
cacagcttca actgccgcgg cgagttcttc tactgcaaca ccagcaacct gttcaacagc 1200 
acctaccaca gcaacaacgg cacctacaag tacaacggca acagcagcag ccccatcacc 1260 
ctgcagtgca agatcaagca gatcgtgcgc atgtggcagg gcgtgggcca ggccacctac 1320 
gcccccccca tcgccggcaa catcacctgc cgcagcaaca tcaccggcat cctqctqacc 1380 
cgcgacggcg gcttcaacac caccaacaac accgagacct tccgccccgg cggcggcgac 1440 
atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtggagat caagcccctg 1500 
ggcatcgccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 1560 
atcggcgccg tgttcctggg cttcctgggc gccgccggca gcaccatggg cgccgccagc 1620 
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atcaccctga ccgtgcaggc ccgccagctg ctgagcggca tcgtgcagca gcagagcaac 1680 
ctgctgaagg ccatcgaggc ccagcagcac atgctgcagc tgaccgtgtg gggcatcaag 1740 
cagctgcagg cccgcgtgct ggccatcgag cgctacctga aggaccagca gctgctgggc 1800 
atctggggct gcagcggccg cctgatctgc accaccgccg tgccctggaa cagcagctgg 1860 
agcaacaaga gcgagaagga catctgggac aacatgacct ggatgcagtg ggaccgcgag 1920 
atcagcaact acaccggcct gatctacaac ctgctggagg acagccagaa ccagcaggag 1980 
aagaacgaga aggacctgct ggagctggac aagtggaaca acctgtggaa ctggttcgac 2040 
atcagcaact ggccctggta catctaactc gag 2073 

<210> 134 
<211> 624 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: MefD125G_TV2_C_2Aopt 
<400> 134 

atgggcggca agtggagcaa gagcagcatc atcggctggc ccgaggtgcg cgagcgcatc 60 
cgccgcaccc gcagcgccgc cgagggcgtg ggcagcgcca gccaggacct ggagaagcac 120 
ggcgccctga ccaccagcaa caccgcccac aacaacgccg cctgcgcctg gctggaggcc 180 
caggaggagg agggcgaggt gggcttcccc gtgcgccccc aggtgcccct gcgccccatg 240 
acctacaagg ccgccatcga cctgagcttc ttcctgaagg agaagggcgg cctggagggc 300 
ctgatctaca gcaagaagcg ccaggagatc ctggacctgt gggtgtacaa cacccagggc 360 
ttcttccccg gctggcagaa ctacaccccc ggccccggcg tgcgcttccc cctgaccttc 420 
ggctggtact tcaagctgga gcccgtggac ccccgcgagg tggaggaggc caacgagggc 480 
gagaacaact gcctgctgca ccccatgagc cagcacggca tggaggacga ggaccgcgag 540 
gtgctgcgct ggaagttcga cagcaccctg gcccgccgcc acatggcccg cgagctgcac 600 
cccgagtact acaaggactg ctga 624 

<210> 135 
<211> 624 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: WefD125G-Myr_TV2_C_ZAopt 
<400> 135 

atggccggca agtggagcaa gagcagcatc atcggctggc 
cgccgcaccc gcagcgccgc cgagggcgtg ggcagcgcca 
ggcgccctga ccaccagcaa caccgcccac aacaacgccg 
caggaggagg agggcgaggt gggcttcccc gtgcgccccc 
acctacaagg ccgccatcga cctgagcttc ttcctgaagg 
ctgatctaca gcaagaagcg ccaggagatc ctggacctgt 
ttcttccccg gctggcagaa ctacaccccc ggccccggcg 
ggctggtact tcaagctgga gcccgtggac ccccgcgagg 
gagaacaact gcctgctgca ccccatgagc cagcacggca 
gtgctgcgct ggaagttcga cagcaccctg gcccgccgcc 
cccgagtact acaaggactg ctga 

<210> 136 

<211> 27 

<212> PRT 

<213> Artificial Sequence 
<220> 



ccgaggtgcg cgagcgcatc 60 
gccaggacct ggagaagcac 120 
cctgcgcctg gctggaggcc 180 
aggtgcccct gcgccccatg 240 
agaagggcgg cctggagggc 300 
gggtgtacaa cacccagggc 360 
tgcgcttccc cctgaccttc 420 
tggaggaggc caacgagggc 480 
tggaggacga ggaccgcgag 540 
acatggcccg cgagctgcac 600 

624 
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<223> WTnative {8_2_TVi_C. 2A) 
<400> 136 

Met Arg Val Met Gly Thr Gin Lys Asn Cys Gin Gin Trp Trp He Trp 
15 10 15 



Gly He Leu Gly Phe Trp Met Leu Met He Cys 
20 25 



<210> 


137 


<211> 


81- - 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


WTnative (8_2_TV1_C 


<400> 


137 



atgagagtga tggggacaca gaagaattgt caacaatggt ggatatgggg catcttaggc 
ttctggatgc taatgatttg t 



<210> 138 

<211> 27 

<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3 > WTtnod ( 8_2_TV1_C . ZA) 

<400> 138 

Met Arg Val Met Gly Thr Gin Lys Asn Cys Gin Gin Trp Trp He Trp 
1 5 • 10 15 



Gly He Leu Gly Phe Trp Met Leu Met He Cys 
20 25 



<210> 


139 


<211> 


81 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


WTmod ( 8_2_TV1_C . ZA) 


<400> 


139 



atgcgcgtga tgggcaccca gaagaactgc cagcagtggt ggatctgggg catcctgggc 
ttctggatgc tgatgatctg c 
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<210> 140 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Tpal 
<400> 140 

Met Asp Ala Met Lya Arg Gly Leu Cys cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Ser 
20 25 



<210> 141 
<211> 75 
<212> DNA 

<213.> Artificial Sequence 
<220> 

<223> Tpal 
<400> 141 

atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt 
tcgcccagcg ccagc 



<210> 142 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Tpa2 
<400> 142 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser 
20 



<210> 143 

<211> 69 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Tlpa2 
<400> 143 

atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt 
' Ccgcccagc 
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